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(57) Abstract 

This invention concerns new non-receptor protein tyrosine phosphatases of the hematopoietic stem cells (PTP HSC). The invention 
specifically concerns native murine and human PTP HSCs, their analogs in other mammals, and their functional derivatives. The invention 
further relates to nucleic acid encoding these proteins, vectors containing and capable of expressing such nucleic acid, and recombinant host 
ceils transformed with such nucleic acid. Assays for identifying agonists and antagonists of the native PTP HSCs, methods for expansion 
of undifferentiated stem cells, and methods for the induction of stem cell differentiation are also within the scope of the invention. 
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PROTEIN TYROSINE PHOSPHATASES OF HEMATOPOIETIC CELLS 

Field of the Invention 

The present invention concerns novel protein tyrosine phosphatases. More particularly, the invention 
concerns non-receptor protein tyrosine phosphatases of hematopoietic stem cells (PTP HSCs) 
5 Background of the Invention 

The ability of the hematopoietic stem cell to function as a source of committed progenitors throughout 
the lifetime of the organism is, at present, a poorly understood phenomenon. The major characteristic of the 
hematopoietic stem cell is its ability to self renew in the absence of differentiation (Morrison et aL, Ann. Rev 
Cell Pev, fliol., j_L 35-71 [1995]). This self renewal phenomenon is especially remarkable in light of the fact 
10 that the hematopoietic stroma, which is in close physical contact with the stem cell, is known to be a source that 
is rich in factors which mediate the growth and differentiation of hematopoietic progenitors (Deryugina and 
Muller-Sieberg, Crit. Rev, in Immunol. 12(2), 115-150 [1993]). For example, a recent PCR analysis of 
hematopoietically active endothelial ceil stromal lines derived from the murine yolk sac revealed that these cells 
produced a plethora of growth and differentiation factors including stem cell factor, FLT 3 ligand, M-CSF. LIF 
15 and 1L-6 (Fennie et q/., Blood 86(12), 4454-4467 [1995]). Such growth factors, in addition to many others, are 
known to induce the expansion and differentiation of stem cells, and these endothelial cell lines induced a rapid 
expansion and differentiation of embryonic hematopoietic stem cells along the myeloid pathway, although very 
early progenitor cells are also amplified by these stromal ceils (C. Fennie and L. Lasky-unpublished data). It has 
also been shown that incubation of highly purified stem cell populations in the presence of various purified 
20 hematopoietic growth factors induces differentiation with subsequent loss of the cells' ability to competitively 
repopuiate the hematopoietic compartment of lethally irradiated animals, consistent with the induction of terminal 
differentiation (Peters et al , Blo^d S20 ):. 30-37 [ 1 996]). Thus, the stem cell, whether in an embryonic or adult 
stromal environment, must maintain an undifferentiated state in spite of the fact that it is being exposed to a 
variety such maturation factors (Deryugina and Muller-Sieberg, supra). 
25 Although the hematopoietic growth factors are very diverse both structurally and functionally, they are 

all believed to play a role in mediating protein phosphorylation (Paulson and Bernstein, Semin Immunol 7(4), 
267-77 [1995]). This protein modification can occur via direct means, such as in the cases of the stem cell factor 
and FLT- 3 receptors, both of which have intrinsic tyrosine kinase activity, or via indirect means, as is the case 
of the hematopoietic/cytokine growth factor receptors for, for example, IL-3, EPO and TPO. in the case of the 
30 hematopoietic/cytokine growth factor receptors, tyrosine phosphorylation is indirectly accomplished by the 
activation of the JAK kinases, which occurs after growth factor mediated receptor dimerization (Ihle et ai , Annu. 
Rev. Immunol. 12, 369-398 [1995]), In both cases, diverse complex pathways of protein phosphorylation arc 
stimulated upon receptor binding. The intrinsic tyrosine kinase receptors mediate their signals via an elaborate 
series of tyrosine phosphorylation events which ultimately activate the RAS signaling pathway (Fantl et al. , Ann. 
35 Rev. Piochem. £2, 453-481 [1993]). This pathway eventually leads to the activation of the serine/threonine 
specific MAP kinase pathway which results in transcriptional activation. In contrast to this intricate pathway, 
hematopoietic growth factor-induced receptor dimerization mediates more direct activation events. Thus, the 
stimulation of the JAK kinases by receptor binding leads to the tyrosine phosphorylation and suhsequent 
dimerization of various STAT proteins. These activated STAT proteins than migrate to the nucleus, bind to 
STAT responsive sites in the nuclear DNA and induce transcription of differentiation and growth specific genes. 

- 1 - 
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Thus, a major effect of the growth factors produced by the hematopoietic stroma is to mediate the activation of 

various cellular pathways by protein phosphorylation. 

The regulation of protein tyrosine phosphorylation is accomplished by a balance between protein 

tyrosine kinases and protein tyrosine phosphatases (PTPs) (Walton and Dixon, Ann. Rev. Biochem. 62 . 1 0 1 - 1 20 

5 [1991]: Sun and Tonks. Trends Biochem. Sci. . 19(1 1). 480-485 [1994]). All PTPs contain a phosphatase domain 

including a subset of highly conserved amino acids, and a recent crystal structure analysis of PTP I B complexed 

with a tyrosine phosphorylated peptide revealed that many of these conserved residues are involved with 

substrate recognition and tyrosine dephosphorylation (Jia et ai, Science 268 (52 1 8), 1 754- 1 758 [ 1 995]). PTPs 

fall into two general categories: receptor type and non-receptor type. The receptor type PTPs have variously 

i 0 sized extracellular domains and, generally, two intracellular phosphatase domains Walton and Doxin. supra; Sun 
and Tonks, supra. The extracellular domains often contain a number of motifs that are generally utilized in cell 
adhesion including immunoglobulin domains and fibronectin-like regions . Many of these PTPs appear to 
function as homotypic and heterotypic sensors of the extracellular space, and they have been hypothesized to play 
roles in contact inhibition, ceil guidance and other intercellular functions (Brady-Kalnay and Tonks, Curr. Qpin. 

15 Cell. Biol. 7(5). 65-657 [1995]). The non-receptor PTPs are generally intracellular enzymes. They have various 
cellular localizations, depending upon the types of domains they contain, and some of the enzymes contain SH2 
motifs which allow them to interact intimately with phosphotyrosine residues. While many of the non-receptor 
PTPs are in various cytoplasmic locations, a small number of these enzymes are found in the nucleus (Flores et 
al. Mol. CelL Biol. 14(71 4938-46 [1994]). Many non-receptor PTPs appear to function as both activators as 

20 well as inhibitors of diverse tyrosine phosphorylated proteins. A subset appear to play important roles in 
hematopoiesis. For example, the motheaten mouse, which has a phenotype of lethal myeloid amplification and 
inflammation, has been found to have a mutation in the PTP 1C gene (Schulz et ai. y CeH 73(7), 1445-54 [1993]); 
(McCulloch and Siminovitch, Adv. Exp. Med. Biol. 365, 145-54 [1994]). In addition, the level of tyrosine 
phosphorylation of the EPO receptor, as well as the level of receptor activation, appears to be in part controlled 

25 by the PTP 1C enzyme as well (Klingmuller et ai, Ce_U£Q(5), 729-38 [1995]). However, while these examples, 
as well as others, highlight the potential importance of the PTPs, very little is known regarding the physiological 
importance of these enzymes. 

Summary of the Invention 

We have hypothesized that one mechanism by which the undifferentiated state of the stem cell might 
30 be maintained is by the dephosphorylation of tyrosine phosphorylated proteins by PTPs. In order to examine 
this possibility, we have analyzed a large number of PTPs from a very primitive embryonic hematopoietic cell 
population using consensus PCR. From this population we have cloned a novel intracellular PTP which has 
many of the characteristics, including down-regulation of the transcript as the hematopoietic stem cells 
differentiate, which might be expected from a PTP involved with the control of differentiation signals such as 
35 those induced by hematopoetic growth factors. We have designated this novel PTP as the "PTP of hematopoietic 
stem cells", which will be referred to hereafter as "PTP HSC. M 

Accordingly, the present invention concerns an isolated non-receptor protein tyrosine phosphatase of 
hematopoietic stem cells (PTP HSC), which 

(1) is expressed predominantly in early hematopoietic stem/progenitor cells; 
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(2) predominantly lacks expression in adult tissues; 

(3) comprises an N-terminal tyrosine phosphatase domain, followed by a region rich in serine, 
threonine, and proline, and a carboxy terminal region of about 15 to 25 amino acids rich in basic amino acid 
residues; and 

5 (4) is capable of tyrosine dephosphorylation in hematopoietic stem cells or progenitor cells. 

This novel PTP preferably downregulates STAT activation. A preferred group of the PTP HSC proteins 
of the present invention includes a protein comprising the amino acid sequence shown in Figure 1 (SEQ. ID. 
NO:2); a protein comprising the amino acid sequence shown in Figure 8 (SEQ. ID. NO: 17), a further 
mammalian homologue of either protein; and derivatives of the foregoing proteins retaining the ability of 
10 tyrosine dephosphorylation in hematopoietic stem cells or progenitor cells. £. 

The PTP HSCs, including derivatives (e.g. amino acid sequence variants) of the native proteins, 
preferably have an active N-terminal tyrosine phosphatase domain, retaining a serine residue at a position 
corresponding to amino acid position 37 in Figure 1 , and retaining an active site cysteine residue at a position 
corresponding to amino acid position 229 in Figure 1, a region rich in serine, threonine, and proline, and a 
15 carboxy- terminal region showing at least about 80% sequence homology with the amino acid sequence between 
positions 430 and 45 1 in Figure 1 . Most preferably, such derivatives have at least about 65% overall sequence 
homology with the amino acid sequence shown in Figure 1 or Figure 8 and retain the ability- of tyrosine - 
dephosphorylation in hematopoietic stem cells or progenitor cells. r ^. 

In another aspect, the present invention concerns agonists and antagonists of PTP HSCs. 
20 In yet another aspect, the invention concerns isolated nucleic acid molecules encoding the PTP HSCs 

herein. 

In a further aspect, the invention concerns vectors comprising nucleic acid encoding the PTP HSCs 

herein, operably linked to control sequences recognized by a host cell transformed with the vector, and to cells 

transformed with such vectors. 
25 In a still further aspect of the present invention, there are provided antibodies capable of specific 

binding to the PTP HSCs of this invention, and hybridoma cell lines producing such antibodies. The antibodies 

may be agonist antibodies, which stimulate the ability of the native PTP HSCs of the present invention to 

dephosphorylate tyrosines, or antagonist antibodies, which block this activity. 

The present invention further concerns an assay for identifying an antagonist or an agonist of a PTP 
30 HSC of the present invention, which comprises contacting the phosphatase domain of the PTP HSC with a 

candidate antagonist or agonist, and monitoring the ability of the phosphatase domain to dephosphorylate 

tyrosine residues. 

In another embodiment, the invention concerns an assay for identifying an antagonist or agonist of a 
PTP HSC of the present invention by cultivating a PTP HSC-expressing hematopoietic stem cell line or 
35 progenitor cell line in the presence of a candidate antagonist or agonist, and monitoring the differentiation of the 
progenitor cells. 

The invention further concerns a method for the differentiation of undifferentiated malignant 
hemopoietic (e.g. leukemia) cells, comprising contacting said cells with an antagonist of a PTP HSC of the 
present invention. 
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In an additional aspect, the invention concerns a method for the induction of hematopoietic stem cell 

differentiation, comprising contacting said stem cells with an antagonist of a PTP HSC of the present invention. 

In another aspect, the invention concerns a method for expansion undifferentiated hematopoietic stems 

cells in cell culture, comprising cultivating stem cells in the presence of a PTP HSC of the present invention or 

5 an agonist antibody specifically binding a native PTP HSC. 

In yet another aspect, the invention concerns a method for the expansion of undifferentiated stem cells 

in vivo comprising administering to a patient an agonist of PTP HSC of the present invention or an agonist 

antibody specifically binding a native PTP HSC, and a stem cell growth factor. 

Brief Description of the Praw^gs 

10 Figure I. DNA and deduced protein sequence of the murine PTP HSC cDNA. Illustrated is the 

DNA sequence (SEQ. ID. NO: 1) and deduced protein sequence (SEQ. ID. NO: 2) of the murine PTP HSC 
cDNA. The overlined region is the phosphatase homologous domain. The asterisk denotes the active site 
cysteine residue. The P,S,T-rich region is illustrated by boxes around these residues. The shaded carboxy 
terminal region is homologous to a nuclear localization signal found on murine PTP PEP (Flores et al, Mol. Cell. 

15 Biol. 14(7). 4938-46 [1994]). 

Figure 2. Sequence homologies of murine PTP HSC, murine PTP PEP, and human PTP PEST. 
A. The phosphatase domain homologies show that these three proteins are highly related to each other. A star 
over the residue (amino acid 37 of PTP HSC) illustrates a conserved serine that is phosphorylated by protein 
kinases A and C and which appears to negatively regulate PTPase activity (Garton and Tonks, EMBO J. 13( 16), 

20 3763-71 [1994]). The amino acid sequence of positions 24-301 of PTP PEP is shown in SEQ. ID. NO: 18; the 
amino acid sequence of positions 24 - 299 of PTP PEST is shown in SEQ. ID. NO: 19. B. A second highly 
homologous region is found at the carboxy terminus of these three proteins (SEQ. ID. NO: 22 showing amino 
acids 783 - 803 of PTP PEP; SEQ. ID. NO: 23 showing amino acids 761 - 781 of PTP PEST). This region has 
been shown to confer nuclear localization on PTP PEP. Interestingly PTP PEST is localized to the cytoplasm, 

25 and it has been hypothesized that this is due to the two negatively charged residues shown by the arrows. As can 
be seen, PTP HSC also contains these negatively charged residues, suggesting that it is also localized to the 
cytoplasm. 

Figure 3. The PTP PST family. Illustrated are the three so far identified members of this family 
including the currently described novel PTP (PTP HSC). Shown are the amino terminal PTP domains (black), 
30 the P,S,T rich domains, and the carboxy terminal nuclear localization homology (shaded). 

Figure 4. Intron sites superimposed on the PTP HSC domain structure. Analysis of the gene 
encoding PTP HSC revealed the location of 14 introns that are shown as triangles in this figure. 

Figure 5. In vitro tyrosine phosphatase activity of the PTP HSC. Shown is the enzymatic activity 
obtained using isolated, bacterially produced GST-phosphatase domain of PTP HSC. Black squares, serial 
35 dilutions of GST-PTP HSC in the absence of ortho vanadate; white squares, enzymatic activity of GST-PTP HSC 
in the presence of vanadate; closed circle, enzymatic activity of GST alone; open circles enzymatic activity with 
an inactive GST-PTP (J. Cheng and L. Lasky-unpublished data). The initial undiluted reaction contained 2 ug 
of each protein. 
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Figure 6. PCR analysis of PTP HSC expression. A. lin lo CD34 hl sca hl or lin lo CD34 hl sca , ° 
hematopoietic progenitor cells were isolated from murine embryos at day 1 1 of development. RNA was isolated 
and analyzed by quantitative PCR. The upper band corresponds to the PTP HSC transcript while the lower band 
corresponds to the triose phosphate isomerase (TPI) internal standard. B. lin lo CD34 ni sca hl hematopoietic 
5 progenitor/stem cells were purified from murine fetal liver and incubated for up to 14 days in IL-s, IL-s, EPO 
and GM-CSF. RNA was isolated at various times and analyzed by quantitative PCR as described in A. 

Figure 7. PTP HSC Transcript analysis in embryonic and adult tissues and hematopoietic cell 
lines. A. Illustrated is a tissue northern blot probed with a cDNA encoding PTP HSC. The left panel illustrates 
RNA isolated from variously aged embryos, while the right panel illustrates RNA isolated from: a. heart, b., 
10 brain, c. spleen, d. lung, e. liver, f. skeletal muscle, g. kidney, h. testis. B. Illustrated is a northern blot of RNA 
isolated from BAF 3 (a), 32D (b) and FDCP (c) hematopoietic progenitor cells. Also shown is the ethidium 
bromide stain of the same gel prior to transfer. C, PCR analysis of RNA isolated from BAF 3 (a), 32 D (b), T 
cell clone (c), FDCP (d), 1 1 day embryos (e) and a control with no reverse transcriptase (f) 

Figure 8. Partial DNA and deduced protein sequence of the human PTP HSC cDNA. Illustrated 
15 is the partial DNA sequence (SEQ. ID. NO: 1 7 ) and deduced protein sequence (SEQ. ID. NO: 1 8) of the human 
PTPHSCcDNA. 

Detailed Description of the Invention 

A. Definitions 

The phrases "non-receptor protein tyrosine phosphatase of hematopoietic stem cells".* "tyrosine 
20 phosphatase of hematopoietic stem cells" and "PTP HSC" are used interchangeably and refer to a native 
intracellular protein tyrosine phosphatase which (1) is expressed predominantly in early hematopoietic stem and 
progenitor cells; (2) predominantly lacks expression in adult tissues; (3) comprises an N-terminal 

tyrosine phosphatase domain, followed by a region rich in serine, threonine, and proline, and a carboxy terminal 
region of about 15 to 25 amino acids rich in basic amino acid residues; and (4) is capable of tyrosine 
25 dephosphorylation in hematopoietic progenitor cells, and functional derivatives of such native tyrosine 
phosphatase. 

The term "native tyrosine phosphatase" in this context refers to a naturally occurring tyrosine 
phosphatase, having the described properties, of any human or non-human animal species, with or without the 
initiating methionine, whether purified from native source, synthesized, produced by recombinant DNA 

30 technology or by any combination of these and/or other methods. Native PTP HSCs specifically include the 
native murine and native human HSC proteins (SEQ. ID. NOs: 2 and , respectively). 

A "functional derivative" of a polypeptide is a compound having a qualitative biological activity in 
common with the native polypeptide. Thus, a functional derivative of a native PTP HSC polypeptide is a 
compound that has a qualitative biological activity in common with a native PTP HSC. "Functional derivatives" 

35 include, but are not limited to, fragments of native polypeptides from any animal species (including humans), 
derivatives of native (human and non-human) polypeptides and their fragments, and peptide and non-peptide 
analogs of native polypeptides, provided that they have a biological activity in common with a respective native 
polypeptide. "Fragments" comprise regions within the sequence of a mature native polypeptide. The term 
"derivative" is used to define amino acid sequence variants, and covalent modifications of a native polypeptide. 
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"Non-peptide analogs" are organic compounds which display substantially the same surface as peptide analogs 

of the native polypeptides. Thus, the non-peptide analogs of the native PTP HSCs of the present invention are 

organic compounds which display substantially the same surface as peptide analogs of the native PTP HSCs. 

Such compounds interact with other molecules in a similar fashion as the peptide analogs, and mimic a biological 

5 activity of a native PTP HSC of the present invention. The polypeptide functional derivatives of the native PTP 

HSCs of the present invention preferably have an active N-terminal tyrosine phosphatase domain, retaining a 

serine residue at a position corresponding to amino acid position 37 in Figure 1, and retaining an active site 

cysteine residue at a position corresponding to amino acid position 229 in Figure 1; a region rich in serine, 

threonine, and proline; and a carboxy-terminal region showing at least about 80% sequence homology with the 

10 amino acid sequence between positions 430 and 45 1 in Figure 1 . Preferably, such derivatives have at least about 

65%. more preferably at least about 75 %, even more preferably at least about 85%, most preferably at least 

about 95% overall sequence homology with the amino acid sequence shown in Figure 1 (SEQ. ID. NO: 2) or 

Figure 8 (SEQ. ID. NO: 18) and retain the ability of tyrosine dephosphorylation in hematopoietic progenitor 

ceils. 

1 5 The term "biological activity" in the context of the definition of functional derivatives is defined as the 

possession of at least one adhesive, regulatory or effector function qualitatively in common with a native 
polypeptide (e.g. PTP HSC). The functional derivatives of the native PTP HSCs of the present invention are 
unified by their qualitative ability of tyrosine dephosphorylation in hematopoietic progenitor cells. In addition, 
the functional derivatives of the native PTP HSCs herein preferably are capable of downregulating STAT 

20 activation. 

The term "agonist" is used to refer to peptide and non-peptide analogs of the native PTP HSCs of the 
present invention and to antibodies specifically binding such native PTP HSCs provided that they retain the 
qualitative ability of tyrosine dephosphorylation in hematopoietic progenitor cells. 

The term "antagonist" is used to refer to a molecule inhibiting the ability of a PTP HSC of the present 
25 invention to dephosphorylate tyrosines. Preferred antagonists essentially completely block tyrosine 
dephosphorylation caused by a PTP HSC. 

"Identity" or "homology" with respect to a native polypeptide and its functional derivative is defined 
herein as the percentage of amino acid residues in the candidate sequence that are identical with the residues of 
a corresponding native polypeptide, after aligning the sequences and introducing gaps, if necessary, to achieve 
30 the maximum percent homology, and not considering any conservative substitutions as part of the sequence 
identity. Neither N- or C-terminal extensions nor insertions shall be construed as reducing identity or homology. 
Methods and computer programs for the alignment are well known in the art. 

The term "stem cell" is used in the broadest sense to describe cells which are not terminally 
differentiated and have the ability to divide throughout the lifetime of the organism, yielding some progeny that 
35 differentiate and others that remain stem cells, including stem cells of any tissue type, such as the lining of the 
gut. the epidermal layer of the skin and the blood-forming tissues. 

The term "hematopoietic stem cell*' is used in the broadest sense to refer to stem cells from which blood 
cells derive, including pluripotent stem cells, lymphoid and myeloid stem cells. 
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The term "hematopoietic progenitor cell" refers to the progeny of a pluripotent hematopoietic stem cell 

which are committed for a particular line of differentiation. These committed progenitor cells are irreversibly 

determined as ancestors of only one or a few blood cell types, e.g. erythrocytes or granulocytes. 

"Hematopoietic growth factors" are growth factors that influence blood cell formation or differentiation 
5 in vivo, such as EPO, TPO, IL-3, 1L-6, stem cell growth factor, M-CSF, G-CSF, GM-CSF, FTL 3 ligand, LIF, 
etc., unified by their role in mediating protein phosphorylation. The receptors of these growth factors are either 
transmembrane tyrosine kinases or are members of the cytokine receptor family. 

Ordinarily, the terms "amino acid" and "amino acids" refer to all naturally occurring L-a-amino acids. 
In some embodiments, however, D-amino acids may be present in the polypeptides or peptides of the present 
10 invention in order to facilitate conformational restriction. For example, in order to facilitate disulfide bond 
formation and stability, a D amino acid cysteine may be provided at one or both termini of a peptide functional 
derivative or peptide antagonist of the native PTP HSC's of the present invention. The amino acids are identified 
by either the single-letter or three-letter designations: 



Asp 


D 


aspartic acid 


lie 


1 


isoleucine 




15 Thr 


T 


threonine 


Leu 


L 


leucine 




Ser 


S 


serine 


Tyr 


Y 


tyrosine 




Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 




Pro 


P 


proline 


His 


H 


histidine 




Gly 


G 


glycine 


Lys 


K 


lysine 




20 Ala 


A 


alanine 


Arg 


R 


arginine 




Cys 


C 


cysteine 


Trp 


W 


tryptophan 




Val 


V 


valine 


Gin 


0 


glutamine 




Met 


M 


methionine 


Asn 


N 


asparagine 





These amino acids may be classified according to the chemical composition and properties of their side 
25 chains. They are broadly classified into two groups, charged and uncharged. Each of these groups is divided 
into subgroups to classify the amino acids more accurately: 
I. Charged Amino Acids 

Acidic Residues : aspartic acid, glutamic acid 
Basic Residues : lysine, arginine, histidine 
30 1L Uncharged Amino Acids 

Hydrophilic Rescue?: serine, threonine, asparagine, glutamine 
Aliphatic Residues : glycine, alanine, valine, leucine, isoleucine 
Non-polar Residues : cysteine, methionine, proline 
Aromatic Residues: phenylalanine, tyrosine, tryptophan 
35 The term "amino acid sequence variant" refers to molecules with some differences in their amino acid 

sequences as compared to a native amino acid sequence. 



-7- 
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Substitutional variants are those that have at least one amino acid residue in a native sequence removed 

and a different amino acid inserted in its place at the same position. The substitutions may be single, where onl> 

one amino acid in the molecule has been substituted, or they may be multiple, where two or more amino acids 

have been substituted in the same molecule. 

5 Insertional variants are those with one or more amino acids inserted immediately adjacent to an amino 

acid at a particular position in a native sequence. Immediately adjacent to an amino acid means connected to 

either the cr-carboxy or a-amino functional group of the amino acid. 

Deletional variants are those with one or more amino acids in the native amino acid sequence removed. 

Ordinarily, deletional variants will have one or two amino acids deleted in a particular region of the molecule. 

10 "Antibodies (Abs) M and "immunoglobulins (lgs)" are glycoproteins having the same structural 

characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins include both 
antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides of the latter kind are, 
for example, produced at low levels by the lymph system and at increased levels by myelomas. 

Native antibodies and immunoglobulins are usually heterotetrameric glycoproteins of about 150.000 

1 5 daltons, composed of two identical light (L) chains and two identical heavy (H) chains. Each light chain is linked 
to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies between the heavy 
chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain 
disulfide bridges. Each heavy chain has at one end a variable domain (V|_j) followed by a number of constant 
domains. Each light chain has a variable domain at one and (V^) and a constant domain at its other end; the 

20 constant domain of the light chain is aligned with the first constant domain of the heavy chain, and the light chain 
variable domain is aligned with the variable domain of the heavy chain. Particular amino acid residues are 
believed to form an interface between the light and heavy chain variable domains (Clothia et aL, J. Mol. Biol. 
18£, 65 1 -663 [ 1 985]; Novotny and Haber, Proc. Natl. Acad. Sci. USA 82, 4592-4596 [ 1 985]). 

The term "variable" refers to the fact that certain portions of the variable domains differ extensively in 

25 sequence among antibodies and are used in the binding and specificity of each particular antibody for its 
particular antigen. However, the variability is not evenly distributed through the variable domains of antibodies. 
It is concentrated in three segments called complementarity determining regions (CDRs) or hypervariable regions 
both in the light chain and the heavy chain variable domains. The more highly conserved portions of variable 
domains are called the framework (FR). The variable domains of native heavy and light chains each comprise 

30 four FR regions, largely adopting a Q-sheet configuration, connected by three CDRs, which form loops 
connecting, and in some cases forming pan of, the p-sheet structure. The CDRs in each chain are held together 
in close proximity by the FR regions and, with the CDRs from the other chain, contribute to the formation of the 
antigen binding site of antibodies (see Kabat, E-A. et at., Sequences of Proteins of Immunological Interest . 
National Institute of Health, Bethesda, MD [1991]). The constant domains are not involved directly in binding 

35 an antibody to an antigen, but exhibit various effector functions, such as participation of the antibody in antibody- 
dependent cellular toxicity. 

Papain digestion of antibodies produces two identical antigen binding fragments, called Fab fragments, 
each with a single antigen binding site, and a residual "Fc" fragment, whose name reflects its ability to crystallize 
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readily. Pepsin treatment yields an F(ab')-) fragment that has two antigen combining sites and is still capable of 

cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen recognition and binding site. 
This region consists of a dimer of one heavy and one light chain variable domain in tight, non-covalent 
5 association. It is in this configuration that the three CDRs of each variable domain interact to define an antigen 
binding site on the surface of the V^-V^ dimer. Collectively, the six CDRs confer antigen binding specificity 
to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific 
for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding 
site. 

10 The Fab fragment also contains the constant domain of the light chain and the first constant domain 

(CH 1 ) of the heavy chain. Fab' fragments differ from Fab fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab'>7 antibody fragments originally were produced as pairs of Fab* fragments which have 

1 5 hinge cysteines between them. Other, chemical couplings of antibody fragments are also known. 

The light chains of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of two clearly distinct types, called kappa and lambda (A.), based on the amino acid sequences of their constant 
domains. - 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 

20 can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG and 
IgM, and several of these may be further divided into subclasses (isotypes), e.g. IgG- 1, IgG-2, IgG-3, and lgG-4; 
IgA- 1 and lgA-2. The heavy chain constant domains that correspond to the different classes of immunoglobulins 
are called a, delta, epsilon, y y and u, respectively. The subunit structures and three-dimensional configurations 
of different classes of immunoglobulins are well known. r 

25 The term "antibody" is used in the broadest sense and specifically covers single monoclonal antibodies 

(including agonist and antagonist antibodies), antibody compositions with polyepitopic specificity, as well as 
antibody fragments (e.g., Fab, F(ab')2» and Fv), so long as they exhibit the desired biological activity. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 

30 except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies 
are highly specific, being directed against a single antigenic site. Furthermore, in contrast to conventional 
(polyclonal) antibody preparations which typically include different antibodies directed against different 
determinants (epitopes), each monoclonal antibody is directed against a single determinant on the antigen. In 
addition to their specificity, the monoclonal antibodies are advantageous in that they are synthesized by the 

35 hybridoma culture, uncontaminated by other immunoglobulins. The modifier "monoclonal" indicates the 
character of the antibody as being obtained from a substantially homogeneous population of antibodies, and is 
not to be construed as requiring production of the antibody by any particular method. For example, the 
monoclonal antibodies to be used in accordance with the present invention may be made by the hybridoma 
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method first described by Kohler & Milstein, Nature 256:495 (1975). or may be made by recombinant DNA 

methods [see, e.g. U.S. Patent No. 4,816,567 (Cabilly et ai)). 

The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) in 

which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in 

5 antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the 

remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies derived from 

another species or belonging to another antibody class or subclass, as well as fragments of such antibodies, so 

long as they exhibit the desired biological activity (U.S. Patent No. 4,8 1 6,567 (Cabilly et ai ; Morrison et ai . 

Proc, Nfrtl, fort* Sci- VSA 81, 685 1 -6855 [ 1 984]). 

10 "Humanized" forms of non-human (e.g. murine) antibodies are chimeric immunoglobulins, 

immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab% or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. For 
the most part, humanized antibodies are human immunoglobulins (recipient antibody) in which residues from 
a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 

1 5 human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. 
In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non- 
human residues. Furthermore, humanized antibody may comprise residues which are found neither in the 
recipient antibody nor in the imported CDR or framework sequences. These modifications are made to further 
refine and optimize antibody performance. In general, the humanized antibody will comprise substantially all 

20 of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of 
a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least 
a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. For further 
details see: Jones et ai . Nature 321 . 522-525 [1986]; Reichmann et ai. Nature 332 . 323-329 [1988]; EP-B-239 

25 400 published 30 September 1987; Presta, Curr. Op. Struct. Biol. 2 593-596 [1992]; and EP-B-451 216 
published 24 January 1996). 

In the context of the present invention the expressions "cell", "cell line", and "cell culture" are used 
interchangeably, and all such designations include progeny. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the 

30 same function or biological property, as screened for in the originally transformed cell, are included. 

The terms "replicable expression vector" and "expression vector" refer to a piece of DNA, usually 
double-stranded, which may have inserted into it a piece of foreign DNA. Foreign DNA is defined as 
heterologous DNA, which is DNA not naturally found in the host cell. The vector is used to transport the foreign 
or heterologous DNA into a suitable host cell. Once in the host cell, the vector can replicate independently of 

35 the host chromosomal DNA, and several copies of the vector and its inserted (foreign) DNA may be generated. 
In addition, the vector contains the necessary elements that permit translating the foreign DNA into a 
polypeptide. Many molecules of the polypeptide encoded by the foreign DNA can thus be rapidly synthesized. 

"Oligonucleotides" are short-length, single- or double-stranded polydeoxynuclcotides that are 
chemically synthesized by known methods [such as phosphotriester, phosphite, or phosphoramidite chemistry. 
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using solid phase techniques such as those described in EP 266,032, published 4 May 1988, or via 

deoxynucieoside H-phosphanate intermediates as described by Froehler et aL. Nucl. Acids Res. 14. 5399 (1986). 
They are then purified on polyacrylamide gels. 

B. Production of PTP HSCs by recombinant DNA technology 
5 1. Identification and isolation of nucleic acid encoding PTP HSCs 

The native PTP HSC proteins of the present invention may be isolated from relatively undifferentiated, 
early hematopoietic stem or progenitor cells. The isolation of murine PTP HSC from the CD34* 1 ' fraction of 
murine 10.5 day yolk sac or embryo cells is illustrated in the examples. Similarly, murine PTP HSC can be 
isolated from CD34 m population originated from bone marrow or fetal liver. The purity of these murine cells 

10 was found to be a critical step in isolating the mRMA encoding the new murine PTP HSC of the present 
invention. A high degree of purity was achieved by purification with a rabbit anti-murine CD34 antibody 
followed by a lineage depletion step and a positive selection step with the Sea antibody. Alternatively, murine 
PTP HSC can be detected and obtained from other relatively undifferentiated precursors of mature murine 
hematopoietic cells, such as, BAF 3, 32D and FDCP hematopoietic progenitor cells, available from the American 

1 5 Type Culture Collection (ATCC). Native human PTP HSC can, for example, be identified in and obtained from 
human CMK progenitor cells. As the PTP HSCs enzymes have an extremely low abundance in embryonic 
tissues, their purification by traditional methods would be very cumbersome and inefficient. Instead, cDNA or 
genomic clones encoding the PTP HSC proteins of the present invention can be prepared using standard 
techniques of recombinant DNA technology. For example, cDNA library can be constructed by obtaining 

20 polyadenylated mRNA from a cell line known to express the desired PTP HSC, and using the mRNA as a 
template to synthesize double stranded cDNA. Exemplary human and non-human cell lines suitable for this 
purpose have been listed hereinabove. A PTP HSC polypeptide gene can also be obtained fronra genomic 
library, such as a human genomic cosmid library. 

Libraries, either cDNA or genomic, are then screened with probes designed to identify ,the gene of 

25 interest or the protein encoded by it. For cDNA expression libraries, suitable probes include monoclonal and 
polyclonal antibodies that recognize and specifically bind to a PTP HSC polypeptide. For cDNA libraries, 
suitable probes include carefully selected oligonucleotide probes (usually of about 20-80 bases in length) that 
encode known or suspected portions of a PTP HSC polypeptide from the same or different species, and/or 
complementary or homologous cDNAs or fragments thereof that encode the same or a similar gene. Appropriate 

30 probes for screening genomic DNA libraries include, without limitation, oligonucleotides, cDNAs, or fragments 
thereof that encode the same or a similar gene, and/or homologous genomic DNAs or fragments thereof. 
Screening the cDN A or genomic library with the selected probe may be conducted using standard procedures 
as described in Chapters 1 0- 1 2 of Sambrook etal. Molecular Cloning: A Laboratory Manual . New York. Cold 
Spring Harbor Laboratory Press, 1989. 

35 If DNA encoding an enzyme of the present invention is isolated by using carefully selected 

oligonucleotide sequences to screen cDNA libraries from various tissues, the oligonucleotide sequences selected 
as probes should be sufficient in length and sufficiently unambiguous that false positives are minimized. The 
actual nucleotide sequence(s) is/are usually designed based on regions which have the least codon redundance. 
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The oligonucleotides may be degenerate at one or more positions. The use of degenerate oligonucleotides is of 

particular importance where a library is screened from a species in which preferential codon usage is not known 

The oligonucleotide must be labeled such that it can be detected upon hybridization to DNA in the 

library being screened. The preferred method of labeling is to use ATP (e.g., y^P) and polynucleotide kinase 

5 to radiolabel the 5* end of the oligonucleotide. However, other methods may be used to label the oligonucleotide, 

including, but not limited to, biotinylation or enzyme labeling. 

cDNAs encoding PTP HSCs can also be identified and isolated by other known techniques of 

recombinant DNA technology, such as by direct expression cloning, or by using the polymerase chain reaction 

(PGR) as described in U.S. Patent No. 4,683,195, issued 28 July 1987, in section 14 of Sambrook et aL supra, 

10 or in Chapter 15 of Current Protocols in Molecular Biolog y. Ausubel et al. eds., Greene Publishing Associates 

and Wiley-Interscience 1991 . The use of the PCR technique for obtaining cDNA encoding murine PTP HSC 

or the PTP domain of this native protein is also illustrated in the examples. 

Once cDNA encoding a PTP HSC enzyme from one species has been isolated, cDNAs from other 

species can also be obtained by cross-species hybridization. According to this approach, human or other 

1 5 mammalian cDNA or genomic libraries are probed by labeled oligonucleotide sequences selected from known 

PTP HSC sequences (such as murine PTP HSC) in accord with known criteria, among which is that the sequence 

should be sufficient in length and sufficiently unambiguous that false positives are minimized. Typically, a ^P- 

labeled oligonucleotide having about 30 to 50 bases is sufficient, particularly if the oligonucleotide contains one 

or more codons for methionine or tryptophan. Isolated nucleic acid will be DNA that is identified and separated 

20 from contaminant nucleic acid encoding other polypeptides from the source of nucleic acid. Hybridization is 

preferably performed under "stringent conditions" which means (1) employing low ionic strength and hgh 

temperature for washing, for example, 0.015 sodium chloride/0.0015 M sodium citrate/0. 1 % sodium dodecy! 

sulfate at 50 ° C, or (2) employing during hybridization a denaturing agent, such as formamide, for example, 50% 

(vol/vol) formamide with 0.1% bovine serum albumin/0.1% polyvinylpyrrolidone/50 nM sodium phosphate 

25 buffer at pH 6.5 with 650 mM sodium chloride, 75 mM sodium citrate at 42 °C. Another example is the use of 

5)% formamide, 5 x SSC (0.75 M sodium chloride, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 

0. 1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 ug/ml), 0. 1% SDS. and 

10% dextran sulfate at 42 °C, with washes at 42 °C in 0.2 x SSC and 0.1% SDS. 

Once the sequence is known, the gene encoding a particular PTP HSC polypeptide can also be obtained 

30 by chemical synthesis, following one of the methods described in Engels and Uhlmann, Agnew. Chem. Int. Ed. 

Engl. 28. 716 (1989). These methods include triester, phosphite, phosphoramidite and H-phosphonate methods. 

PCR and other autoprimer methods, and oligonucleotide syntheses on solid supports. 

2. Cloning and expression of nucleic acid encoding FTP HSCs 

Once the nucleic acid encoding PTP HSC is available, it is generally ligated into a replicable expression 

35 vector for further cloning (amplification of the DNA), or for expression. 

Expression and cloning vectors are well known in the art and contain a nucleic acid sequence that 

enables the vector to replicate in one or more selected host cells. The selection of the appropriate vector will 

depend on I ) whether it is to be used for DNA amplification or for DNA expression. 2) the size of the DNA to 

be inserted into the vector, and 3) the host cell to be transformed with the vector. Each vector contains various 
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components depending on its function (amplification of DNA of expression of DNA) and the host cell for which 

it is compatible. The vector components generally include, but are not limited to, one or more of the following: 

a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a 

transcription termination sequence. Construction of suitable vectors containing one or more of the above listed 

5 components, the desired coding and control sequences, employs standard ligation techniques. Isolated plasm ids 

or DNA fragments are cleaved, tailored, and religated in the form desired to generate the plasmids required. For 

analysis to confirm correct sequences in plasmids constructed, the ligation mixtures are commonly used to 

transform E. coli cells, e.g. E. coli KI2 strain 294 (ATCC 3 1 ,446) and successful transformants selected by 

ampicillin or tetracycline resistance where appropriate. Plasmids from the transformants are prepared, analyzed 

1 0 by restriction endonuclease digestion, and/or sequenced by the method of Messing et ai , Nucleic Acids Res. 9. ^ 
309 ( 1 98 1 ) or by the method of Maxam et ai , Methods in Enzvmologv 65, 499 ( 1 980). i 

The polypeptides of the present invention may be expressed in a variety of prokaryotic and eukaryotic 
host cells. Suitable prokaryotes include gram negative or gram positive organisms, for example £. coli or bacilli. 
A preferred cloning host is £. coli 294 (ATCC 31,446) although other gram negative or gram positive 

1 5 prokaryotes such as £, £oJi B, £. £oJi X 1 776 (ATCC 3 1 ,537), E. coJi W3 1 10 (ATCC 27,325), Pseudomonas 
species, or Serratia Marcesans are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable hosts for 
vectors herein. Saccharomvces cerevisiae . or common baker's yeast, is the most commonly used among lower 
eukaryotic host microorganisms. However, a number of other genera, species and strains are commonly available 

20 and useful herein, such as S. pom be [Beach and Nurse, Nature 290 . 140 (1981)], Kluvveromvces lactis 
[Louvencourt et ai , J. Bacteriol. 737 ( 1 983)]; varrowia (EP 402,226); Pichia oastoris (EP 1 83,070), Trichoderma 
reesia (EP 244,234), Neurospora crassa [Case et ai, Proc. Natl. Acad. Sci. USA 76, 5259-5263 (1979)]; and 
Aspergillus hosts such as A. nidulaps [Ballance et ai. Biochem. Biophvs, Res. Commun. 1 12. 284-289 (1983): 
Tilburne/ a/., Qejos 2fi, 205-221 (1983); Yelton et ai. Proc. Natl. Acad. Sci. USA 81. 1470- 1474 (-1 984)] and 

25 A. niger [Kelly and Hvnes. EMBQ J. 4. 475-479 (1985^. 

Suitable host cells may also derive from multicellular organisms. Such host cells are capable of 
complex processing and glycosylation activities. In principle, any higher eukaryotic cell culture is workable, 
whether from vertebrate or invertebrate culture, although cells from mammals such as humans are preferred. 
Examples of invertebrate cells include plants and insect cells. Numerous baculoviral strains and variants and 

30 corresponding permissive insect host cells from hosts such as Spodoptera frugioerda (caterpillar), Aedes aegvpti 
(mosquito), Aedes albopictus (mosquito), Drosophila melangaster (fruitfly), and Bombvx mori host cells have 
been identified. See, e.g. Luckow et ai, Bio/Technology 6, 47-55 (1988); Miller et ai. in Genetic Engineering . 
Setiow, J.K. et ai. eds.. Vol. 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda et ai. Nature 315 . 592-594 
(1985). A variety of such viral strains are publicly available, e.g. the L-l variant of Autoerapha californica NPV. 

35 and such viruses may be used as the virus herein according to the present invention, particularly for transfection 
of Spodoptera frueiperda cells. 

Plant cell cultures of cotton, corn, potato, soybean, petunia, tomato, and tobacco can be utilized as hosts. 
Typically, plant cells are transfected by incubation with certain strains of the bacterium Agrobacterium 
tumefaciens. which has been previously manipulated to contain the PTP HSC DNA. During incubation of the 
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plant cell culture with A. tumefaciens . the DNA encoding a PTP HSC is transferred to the plant cell host such 

that it is transfected, and will, under appropriate conditions, express the PTP HSC DNA. In addition, regulatory 

and signal sequences compatible with plant cells are available, such as the nopaline synthase promoter and 

polyadenylation signal sequences. Depicker et al. J. Mol. Apol. Gen. I. 561 (1982). In addition, DNA segments 

5 isolated from the upstream region of the T-DNA 780 gene are capable of activating or increasing transcription 

levels of plant-expressible genes in recombinant DNA-containing plant tissue. See EP 32 U 96 published 2 1 

June 1989. 

However, interest has been greatest in vertebrate ceils, and propagation of vertebrate cells in culture 
(tissue culture) is Qeigg well known. See Tissue Culture . Academic Press, Kruse and Patterson, editors (1973) 

1 0 Examples of useful mammalian host cell lines are monkey kidney CV 1 line transformed by SV40 (COS-7. ATCC 
CRL 1651); human embryonic kidney cell line [293 or 293 cells subcloned for growth in suspension culture. 
Graham et ai, J. Gen. Virol. 2& 59 ( 1 977)]; baby hamster kidney cells 9BHK. ATCC CCL 1 0); Chinese hamster 
ovary cells/-DHFR [CHO, Urlaub and Chasm, Proc. Natl. Acad. Sci. USA 77, 4216 (1980)]; mouse sertolli cells 
[TM4, Mather, Biol. Renrod. 22. 243-25 1 ( 1 980)]; monkey kidney cells (C V 1 ATCC CCL 70); African green 

15 monkey kidney cells (VERO-76, ATCC CRL- 1587); human cervical carcinoma cells (HELA, ATCC CCL 2): 
canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3 A, ATCC CRL 1442); human lung 
cells (W138, ATCC CCL75); human liver cells (Hep G2, HB 8065); mouse mammary tumor (MMT 060562. 
ATCC CCL51); TR1 cells [Mather et al. Annals N.Y. Acad. Sci. 383 . 44068 (! 982)]; MRC 5 cells; FS4 cells; 
and a human hepatoma cell line (Hep G2). Preferred host cells are human embryonic kidney 293 and Chinese 

20 hamster ovary cells. 

Particularly useful in the practice of this invention are expression vectors that provide for the transient 
expression in mammalian cells of DNA encoding a PTP HSC. In general, transient expression involves the use 
of an expression vector that is able to replicate efficiently in a host cell, such that the host cell accumulates many 
copies of the expression vector and, in turn, synthesizes high levels of a desired polypeptide encoded by the 

25 expression vector. Transient systems, comprising a suitable expression vector and a host cell, allow for the 
convenient positive identification of polypeptides encoded by clones DNAs, as well as for the rapid screening 
of such polypeptides for desired biological or physiological properties. Thus, transient expression systems are 
particularly useful in the invention for purposes of identifying analogs and variants of a PTP HSC. 
Other methods, vectors, and host cells suitable for adaptation to the synthesis of the PTP HSC polypeptides in 

30 recombinant vertebrate cell culture are described in Getting et a/.. Nature 293 . 620-625 (1981); Mantel et al, 
Nature 281 . 40-46 (1979); Levinson et al\ EP 1 17.060 and EP 1 17,058. Particularly useful plasmids for 
mammalian cell culture expression of the PTP HSC polypeptides are pRK5 (EP 307,247), or pSVI6B (PCT 
Publication No. WO 91/08291). 

Other cloning and expression vectors suitable for the expression of the PTP HSCs of the present 

35 invention in a variety of host cells are, for example, described in EP 457,758 published 27 November 1991. A 
large variety of expression vectors is now commercially available. An exemplary commercial yeast expression 
vector is pPIC.9 (Invitrogen), while an commercially available expression vector suitable for transformation of 
E. coli cells is PET! 5b (Novagen). 
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C. Culturing the Host Cells 

Prokaryotes cells used to produced the PTP HSCs of this invention are cultured in suitable media as 
describe generally in Sambrook et ai, supra . 

Mammalian cells can be cultured in a variety of media. Commercially available media such as Ham's 
5 F1 0 (Sigma), Minimal Essential Medium (MEM, Sigma), RPMI- 1640 (Sigma), and Dulbecco's Modified Eagle's 
Medium fDMEM, Sigma) are suitable for culturing the host cells. In addition, any of the media described in 
Ham and Wallace, Meth. Enzvmol. 44 (1979); Barnes and Sato, Anal. Biochem. 102 . 255 (1980), US 
4,767,704; 4,657,866; 4,927,762; or 4,560,655; WO 90/03430; WO 87/00195 or US Pat. Re. 30.985 may be 
used as culture media for the host cells. Any of these media may be supplemented as necessary with hormones 
10 and/or other growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as sodium 
chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides (such as adenosine and 
thymidine), antibiotics (such as Gentamycin™ drug) trace elements (defined as inorganic compounds usually 
present at final concentrations in the micromolar range), and glucose or an equivalent energy source. Any other 
necessary supplements may also be included at appropriate concentrations that would be known to those skilled 
15 in the art. The culture conditions, such as temperature, pH and the like, suitably are those previously used with 
the host cell selected for cloning or expression, as the case may be, and will be apparent to the ordinary artisan. 
The host cells referred to in this disclosure encompass cells in in vitro cell culture as well as cells that 
are within a host animal or plant. $ 

It is further envisioned that the PTP HSCs of this invention may be produced by homologous 
20 recombination, or with recombinant production methods utilizing control elements introduced into cells already 
containing DN A encoding the particular PTP HSC. 

D. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl 

25 Acad. Svit USA 22, 5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Various labels may be employed, most 
commonly radioisotopes, particularly 32 P. However, other techniques may also be employed, such as using 
biotin-modified nucleotides for introduction into a polynucleotide. The biotin then serves as a site for binding 
to avidin or antibodies, which may be labeled with a wide variety of labels, such as radionuclides, fluoresces, 

30 enzymes, or the like. Alternatively, antibodies may be employed that can recognize specific duplexes, including 
DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. The antibodies in 
turn may be labeled and the assay may be carried out where the duplex is bound to the surface, so that upon the 
formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 

35 immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate directly 
the expression of gene product. With immunohistochemical staining techniques, a cell sample is prepared, 
typically by dehydration and fixation, followed by reaction with labeled antibodies specific for the gene product 
coupled, where the labels are usually visually detectable, such as enzymatic labels, fluorescent labels. 
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luminescent labels, and the like. A particularly sensitive staining technique suitable for use in the present 

invention is described by Hse et al. Am. J. Clin. Pharm. 75. 734-738 (1980). 

Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either 
monoclonal or polyclonal, and may be prepared in any animal. Conveniently, the antibodies may be prepared 
5 against a native PTP HSC polypeptide, or against a synthetic peptide based on the DNA sequence provided 
herein as described further hereinbelow. 

E. Amino Acid Sequence Variants of a native PTP HSCs 

Amino acid sequence variants of native PTP HSCs are prepared by methods known in the art by 
introducing appropriate nucleotide changes into a PTP HSC DNA, or by in vitro synthesis of the desired 

10 polypeptide. There are two principal variables in the construction of amino acid sequence variants: the location 
of the mutation site and the nature of the mutation. With the exception of naturally-occurring alleles, which do 
not require the manipulation of the DNA sequence encoding the PTP HSC, the amino acid sequence variants of 
PTP HSCs are preferably constructed by mutating the DNA, either to arrive at an allele or an amino acid 
sequence variant that does not occur in nature. 

15 One group of the mutations will be created within the phosphatase (PTP) domain of the enzymes of 

the present invention. Non-conservative substitutions within this domain may result in PTP HSC variants which 
loose their ability to dephosphatase tyrosines and will, therefore, be useful as antagonists of native PTP HSCs. 
PTP HSC variants mutated to enhance their enzymatic activity will be useful, for example, as more effective 
inhibitors of progenitor/stem cell differentiation. 

20 Alternatively or in addition, amino acid alterations can be made at sites that differ in PTP HSC proteins 

from various species, or in highly conserved regions, depending on the goal to be achieved. Sites at such 
locations will typically be modified in series, e.g. by (1 ) substituting first with conservative choices and then with 
more radical selections depending upon the results achieved, (2) deleting the target residue or residues, or (3) 
inserting residues of the same or different class adjacent to the located site, or combinations of options 1 -3. One 

25 helpful technique is called "alanine scanning" (Cunningham and Wells. Science 244 . 1081-1085 [1989]). 

After identifying the desired mutation(s), the gene encoding a PTP HSC variant can, for example, be 
obtained by chemical synthesis as hereinabove described. More preferably, DNA encoding a PTP HSC amino 
acid sequence variant is prepared by site-directed mutagenesis of DNA that encodes an earlier prepared variant 
or a nonvariant version of the PTP HSC. Site-directed (site-specific) mutagenesis allows the production of PTP 

30 HSC variants through the use of specific oligonucleotide sequences that encode the DNA sequence of the desired 
mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size 
and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. 
Typically, a primer of about 20 to 25 nucleotides in length is preferred, with about 5 to 10 residues on both sides 
of the junction of the sequence being altered. In general, the techniques of site-specific mutagenesis are well 

35 known in the art, as exemplified by publications such as, Edelman et ai. y DNA 2, 183 (1983). As will be 
appreciated, the site-specific mutagenesis technique typically employs a phage vector that exists in both a single- 
stranded and double-stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as 
the M 13 phage, for example, as disclosed by Messing et aL, Third Cleveland Symposium on Macromolecules 
and Recombinant DNA . A. Walton, ed., Elsevier, Amsterdam (1981). This and other phage vectors are 
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commercially available and their use is well known to those skilled in the art. A versatile and efficient procedure 

for the construction of oligodeoxyribonucleotide directed site-specific mutations in DNA fragments using M 1 3- 
derived vectors was published by Zoller, M.J. and Smith, M.. Nucleic Acids Res. 10. 6487-6500 [1982]). Also, 
plasmid vectors that contain a single-stranded phage origin of replication (Veira et al, Meth. Enzvmol. I 53 . 3 
5 [ 1 987]) may be employed to obtain single-stranded DNA. Alternatively, nucleotide substitutions are introduced 
by synthesizing the appropriate DNA fragment in vitro, and amplifying it by PCR procedures known in the art. 

The PCR technique may also be used in creating amino acid sequence variants of a PTP HSC, In a 
specific example of PCR mutagenesis, template plasmid DNA (I ug) is linearized by digestion with a restriction 
endonuclease that has a unique recognition site in the plasmid DNA outside of the region to be amplified. Of 

1 0 this material, 100 ng is added to a PCR mixture containing PCR buffer, which contains the four deoxy nucleotide d 
triphosphates and is included in the GeneAmp 1 ^ kits (obtained from Perkin-Elmer Cetus, Norwalk, CT and 
Emeryville, CA), and 25 pmole of each oligonucleotide primer, to a final volume of 50 ul. The reaction mixture 
is overlayered with 35 uJ mineral oil. The reaction is denatured for 5 minutes at 100°C, placed briefly on ice, 
and then 1 ul Thermus aquaticus (Tag) DNA polymerase (5 units/ I), purchased from Perkin-Elmer Cetus, 

15 Norwalk, CT and Emeryville, CA) is added below the mineral oil layer. The reaction mixture is then inserted 

into a DNA Thermal Cycler (purchased from Perkin-Elmer Cetus) programmed as follows: fri 
2 min. 55oC, 

30 sec. 72°C, then 19 cycles of the following: 
30 sec. 94oC, 

20 30 sec. 55°C, and <' 

30 sec. 72 oC. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous phase 
transferred to a new vial, extracted with phenol/chloroform (50:50 vol), and ethanol precipitated, and the DNA 
is recovered by standard procedures. This material is subsequently subjected to appropriate treatments for 

25 insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique described by 
Wells et ai f Gene 21, 3 1 5 (1985)]. 

Additionally, the so-called phagemid display method may be useful in making amino acid sequence 
variants of native or variant PTP HSCs or their fragments. This method involves (a) constructing a replicable 

30 expression vector comprising a first gene encoding an receptor to be mutated, a second gene encoding at least 
a portion of a natural or wild-type phage coat protein wherein the first and second genes are heterologous, and 
a transcription regulatory element operably linked to the first and second genes, thereby forming a gene fusion 
encoding a fusion protein; (b) mutating the vector at one or more selected positions within the first gene thereby 
forming a family of related plasmids; (c) transforming suitable host cells with the plasmids; (d) infecting the 

35 transformed host cells with a helper phage having a gene encoding the phage coat protein; (e) culturing the 
transformed infected host cells under conditions suitable for forming recombinant phagemid panicles containing 
at least a portion of the plasmid and capable of transforming the host, the conditions adjusted so that no more 
than a minor amount of phagemid particles display more than one copy of the fusion protein on the surface of 
the particle; (0 contacting the phagemid particles with a suitable antigen so that at least a portion of the phagemid 
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particles bind to the antigen; and (g) separating the phagemid particles that bind from those that do not. Steps 

(d) through (g) can be repeated one or more times. Preferably in this method the plasmid is under tight control 

of the transcription regulatory element, and the culturing conditions are adjusted so that the amount or number 

of phagemid particles displaying more than one copy of the fusion protein on the surface of the panicle is less 

5 than about 1%. Also, preferably, the amount of phagemid particles displaying more than one copy of the fusion 

protein is less than 10% of the amount of phagemid particles displaying a single copy of the fusion protein. Most 

preferably, the amount is less than 20%. Typically in this method, the expression vector will further contain a 

secretory signal sequence fused to the DNA encoding each subunit of the polypeptide and the transcription 

regulatory element will be a promoter system. Preferred promoter systems are selected from lac Z. a rl , tac, T7 

10 polymerase, tryptophan, and alkaline phosphatase promoters and combinations thereof. Also, normally the 
method will employ a helper phage selected from M I3K07, M 1 3R408, M 1 3-VCS, and Phi X 1 74. The preferred 
helper phage is M13K07, and the preferred coat protein is the M 13 Phage gene 111 coat protein. The preferred 
host is E. coli, and protease-deficient strains of E. coli. 

Further details of the foregoing and similar mutagenesis techniques are found in general textbooks, such 

1 5 as, for example, Sambrook et aL, supra, and Current Pr otocols in Molecular Biology . Ausubel et al. eds., supra. 

Naturally-occurring amino acids are divided into groups based on common side chain properties: 

(1 ) hydrophobic: norleucine, met, ala, val. leu, ile; 

(2) neutral hydrophobic: cys, ser, thr; 

(3) acidic: asp, glu; 

20 (4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Conservative substitutions involve exchanging a member within one group for another member within 
the same group, whereas non-conservative substitutions will entail exchanging a member of one of these classes 
25 for another. 

Amino acid sequence deletions generally range from about 1 to 30 residues, more preferably about 1 
to 1 0 residues, and typically are contiguous. 

Amino acid insertions include amino- and/or carboxyl-terminal fusions ranging in length from one 
residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions of single or 
30 multiple amino acid residues. Intrasequence insertions (i.e. insertions within the PTP HSC protein amino acid 
sequence) may range generally from about 1 to 10 residues, more preferably 1 to 5 residues, more preferably 1 
to 3 residues. Examples of terminal insertions include the PTP HSC polypeptides with an N-terminal methionyl 
residue, an artifact of its direct expression in bacterial recombinant cell culture, and fusion of a heterologous N- 
tenninal signal sequence to the N-terrninus of the PTP HSC molecule to facilitate the secretion of the mature PTP 
35 HSC from recombinant host cells. Such signal sequences will generally be obtained from, and thus homologous 
to, the intended host cell species. Suitable sequences include STII or Ipp for E. coli . alpha factor for yeast, and 
viral signals such as herpes gD for mammalian cells. 

Other insertional variants of the native PTP HSC molecules include the fusion of the N- or C-terminus 
of the TRAF molecule to immunogenic polypeptides, e.g. bacterial polypeptides such as beia-lactamase or an 
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enzyme encoded by the E. coli trp locus, or yeast protein, and C-terminal fusions with proteins having a long 

half-life such as immunoglobulin regions (preferably immunoglobulin constant regions), albumin, or ferritin, as 

described in WO 89/02922 published on 6 April 1989. 

Since it is often difficult to predict in advance the characteristics of a variant PTP HSC. it will be 

5 appreciated that some screening will be needed to select the optimum variant. 

F. Covalent Modifications of PTP HSC Polypeptides 

Covalent modifications of PTP HSCs are included within the scope herein. Such modifications are 
traditionally introduced by reacting targeted amino acid residues of the PTP HSC polypeptides with an organic 
derivatizing agent that is capable of reacting with selected sides or terminal residues, or by harnessing 

10 mechanisms of post- trans lational modifications that function in selected recombinant host cells. The resultant 
covalent derivatives are useful in programs directed at identifying residues important for biological activity, for 
immunoassays of the PTP HSC, or for the preparation of anti-PTP HSC antibodies for immunoaffinity 
purification of the recombinant. For example, complete inactivation of the biological activity of the protein after 
reaction with ninhydrin would suggest that at least one arginyl or lysyl residue is critical for its activity. 

1 5 whereafter the individual residues which were modified under the conditions selected are identified by isolation 
of a peptide fragment containing the modified amino acid residue. Such modifications are within the ordinary 
skill in the art and are performed without undue experimentation. 

Cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding amines), such 
as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. Cysteinyl 

20 residues also are derivatized by reaction with bromotrifluoroacetone, a-brorno-p-(5-imidozoyl)propionic acid, 
chloroacetyl phosphate, N-aikylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p- 
chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa-l ,3-diazole. 

Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0 because this agent 
is relatively specific for the histidyl side chain. Para-bromophenacyl bromide also is useful; therreaction is 

25 preferably performed in 0. 1M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. 
Derivatization with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable 
reagents for derivatizing oc-amino-containing residues include imidoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid; O-methyiisourca; 2,4- 

30 pentanedione; and transaminase-catalyzed reaction with giyoxylate. 

Arginyl residues are modified by reaction with one or several conventional reagents, among them 
phenylglyoxal, 2,3-butanedione t 1,2-cyclohexanedione. and ninhydrin. Derivatization of arginine residues 
requires that the reaction be performed in alkaline conditions because of the high pK a of the guanidine functional 
group. Furthermore, these reagents may react with the groups of lysine as well as the arginine epsilon-amino 

35 group. 

The specific modification of tyrosyl residues may be made, with particular interest in introducing 
spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitrometharie. Most 
commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl tyrosyl species and 3-nitro 
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derivatives, respectively. Tyrosyl residues are iodinated using *~^l or ^'l to prepare labeled proteins for use 

in radioimmunoassay. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides (R'- 
N=C=N-R') such as l-cyclohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or 1 -ethyl-3-(4-azonia-4.4- 
5 dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and 
glutaminyl residues by reaction with ammonium ions. 

Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and 
aspartyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Either form of 
these residues falls within the scope of this invention. 

10 Other modifications include hydroxy lation of proline and lysine, phosphorylation of hydroxy 1 groups 

of seryl, threonyl or tyrosyl residues, methylation of the a -amino groups of lysine, arginine, and histidine side 
chains (T.E. Creighton, Proteins: Structure and Molecular Properties . W.H. Freeman & Co., San Francisco, pp. 
79-86 [1983]), acetylation of the N-terminal amine, and amidation of any C -terminal carboxyl group. The 
molecules may further be covalently linked to nonproteinaceous polymers, e.g. polyethylene glycol. 

15 polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. S.N. 07/275,296 or U.S. patents 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Derivatization with Afunctional agents is useful for preparing intramolecular aggregates of the PTP 
HSCs with polypeptides as well as for cross-linking the PTP HSC polypeptide to a water insoluble support matrix 
or surface for use in assays or affinity purification. In addition, a study of interchain cross-links will provide 

20 direct information on conformational structure. Commonly used cross-linking agents include 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, homobifunctional imidoesters, 
and bifunctional maleimides. Derivatizing agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate yield 
photoactivatable intermediates which are capable of forming cross-links in the presence of light. Alternatively, 
reactive water insoluble matrices such as cyanogen bromide activated carbohydrates and the systems reactive 

25 substrates described in U.S. Patent Nos. 3,959,642; 3,969,287; 3,691,016; 4,195,128; 4,247,642; 4,229,537; 
4,055,635; and 4,330,440 are employed for protein immobilization and cross- linking. 

Certain post-translational modifications are the result of the action of recombinant host cells on the 
expressed polypeptide. Glutaminyl and aspariginyl residues are frequently post-translational ly deamidated to 
the corresponding glutamyl and aspartyl residues. Alternatively, these residues are deamidated under mildly 

30 acidic conditions. Either form of these residues falls within the scope of this invention. 

Other post-translational modifications include hydroxyiation of proline and lysine, phosphorylation of 
hydroxy I groups of seryl, threonyl or tyrosyl residues, methylation of the a -amino groups of lysine, arginine, and 
histidine side chains [T.E. Creighton, Proteins: Structure and Molecular Properties . W.H. Freeman & Co.. San 
Francisco, pp. 79-86 (1983)]. 

3 5 Other derivatives comprise the novel peptides of this invention covalently bonded to a nonproteinaceous 

polymer. The nonproteinaceous polymer ordinarily is a hydrophilic synthetic polymer, i.e. a polymer not 
otherwise found in nature. However, polymers which exist in nature and are produced by recombinant or in vitro 
methods are useful, as are polymers which are isolated from nature. Hydrophilic polyvinyl polymers fall within 
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the scope of this invention, e.g. polyvinylalcohol and polyvinylpyrrolidone. Particularly useful are 

polyvinylalkyiene ethers such a polyethylene glycol, polypropylene glycol. 

The PTP HSC polypeptides may be linked to various nonproteinaceous polymers, such as polyethylene 
glycol, polypropylene glycol or polyoxyaikylenes, in the manner set forth in U.S. Patent Nos. 4,640,835: 
5 4,496,689; 4,30 1 , 1 44; 4,670.4 1 7; 4,79 1,192 or 4, i 79,337. 

The PTP HSCs may be entrapped in microcapsules prepared, for example, by coacervation techniques 
or by interfacial polymerization, in colloidal drug delivery systems (e.g. liposomes, albumin microspheres, 
microemulsions, nano-particles and nanocapsules), or in macroemulsions. Such techniques are disclosed in 
Remington's Pharmaceutical Sciences . 16th Edition, Osol, A., Ed. (1980). 
10 G. Anti-PTP HSC antibody preparation 

(t) Polyclonal antibodies 

Polyclonal antibodies to a PTP HSC molecule generally are raised in animals by multiple subcutaneous 
(sc) or intraperitoneal (ip) injections of the PTP HSC and an adjuvant. It may be useful to conjugate the PTP 
HSC or a fragment containing the target amino acid sequence to a protein that is immunogenic in the species to 

15 be immunized, e.g. keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin 
inhibitor using a bifunctional or derivatizing agent, for example maleimidobenzoyl sulfosuccinimide ester 
(conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues), glytaraldehyde. 
succinic anhydride, SOCh, or R 1 N=C=NR, where R and R l are different alkyl groups. c 

Animals are immunized against the immunogenic conjugates or derivatives by combining I'mg or 1 ug 

20 of conjugate (for rabbits or mice, respectively) with 3 volumes of Freud's complete adjuvant and injecting the 
solution intradermally at multiple sites. One month later the animals are boosted with 1/5 to 1/10 the original 
amount of conjugate in Freud's complete adjuvant by subcutaneous injection at multiple sites. 7 to 14 days later 
the animals are bled and the serum is assayed for anti-PTP HSC antibody titer. Animals are boosted until the 
titer plateaus. Preferably, the animal boosted with the conjugate of the same PTP HSC. but conjugated to a 

25 different protein and/or through a different cross-linking reagent. Conjugates also can be made in recombinant 
cell culture as protein fusions. Also, aggregating agents such as alum are used to enhance the immune response, 
(ii) Monoclonal antibodies 

Monoclonal antibodies are obtained from a population of substantially homogeneous antibodies, i.e., 

the individual antibodies comprising the population are identical except for possible naturally-occurring 
30 mutations that may be present in minor amounts. Thus, the modifier "monoclonal" indicates the character of the 

antibody as not being a mixture of discrete antibodies. 

For example, the anti-PTP HSC monoclonal antibodies of the invention may be made using the 

hybridoma method first described by Kohler & Mtlstein, Nature 256:495 (1975). or may be made by recombinant 

DNA methods [Cabilly, e/a/., U.S. Pat. No. 4,816,567]. 
35 In the hybridoma method, a mouse or other appropriate host animal, such as hamster is immunized as 

hereinabove described to elicit lymphocytes that produce or are capable of producing antibodies that will 

specifically bind to the protein used for immunization. Alternatively, lymphocytes may be immunized in vitro . 

Lymphocytes then are fused with myeloma cells using a suitable fusing agent, such as polyethylene glvcol, to 
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form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice , pp.59- 103 (Academic Press, 

1986)]. 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium that preferably 
contains one or more substances that inhibit the growth or survival of the unfused, parental myeloma cells. For 
5 example, if the parental myeloma cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase 
(HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, aminopterin. 
and thymidine (HAT medium), which substances prevent the growth of HGPRT-deficient cells. 

Preferred myeloma cells are those that fuse efficiently, support stable high level expression of antibody 
by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. Among these, 

10 preferred myeloma cell lines are murine myeloma lines, such as those derived from MOPC-21 and MPC-1 1 
mouse tumors available from the Salk Institute Cell Distribution Center, San Diego, California USA, and SP-2 
cells available from the American Type Culture Collection, Rockville, Maryland USA. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal 
antibodies [Kozbor, J. Immunol. 133 :300 1 ( 1984); Brodeur, et al.. Monoclonal Antibody Production Techniques 

15 and Applications , pp.5 1-63 (Marcel Dekker, Inc., New York, 1987)]. 

Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against PTP HSC. Preferably, the binding specificity of monoclonal antibodies produced 
by hybridoma cells is determined by immunoprecipitation or by an jn vitro binding assay, such as 
radioimmunoassay (R1A) or enzyme-linked immunoabsorbent assay (EL1SA). The binding affinity of the 

20 monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson & Pollard, Anal. 
Biochem. JLQ7:220 (1980). 

After hybridoma cells are identified that produce antibodies of the desired specificity, affinity, and/or 
activity, the clones may be subcloned by limiting dilution procedures and grown by standard methods. Goding, 
Monoclonal Antibodies: Principles and Practice , pp.59- 104 (Academic Press, 1986). Suitable culture media for 

25 this purpose include, for example, Dulbecco's Modified Eagle's Medium or RPM1-1640 medium. In addition, 
the hybridoma cells may be grown in vivo as ascites tumors in an animal. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture medium, 
ascites fluid, or serum by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxyiapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

30 DNA encoding the monoclonal antibodies of the invention is readily isolated and sequenced using 

conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes 
encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 
preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then 
transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that 

35 do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also may be modified, for example, by substituting the coding sequence for 
human heavy and light chain constant domains in place of the homologous murine sequences, Morrison, et al.. 
Proc. Nat. Acad. Sci. 81 . 6851 (1984), or by covalently joining to the immunoglobulin coding sequence all or 
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part of the coding sequence for a non-immunogiobuiin polypeptide. In that manner, "chimeric" or "hybrid" 

antibodies are prepared that have the binding specificity of an anti-TRAF monoclonal antibody herein. 

Typically such non-immunoglobulin polypeptides are substituted for the constant domains of an 
antibody of the invention, or they are substituted for the variable domains of one antigen-combining site of an 
5 antibody of the invention to create a chimeric bivalent antibody comprising one antigen-combining site having 
specificity for a PTP HSC and another antigen-combining site having specificity for a different antigen. 

Chimeric or hybrid antibodies also may be prepared in vitro using known methods in synthetic protein 
chemistry, including those involving crosslinking agents. For example, immunotoxins may be constructed using 
a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents for this purpose 
10 include iminothiolate and methyl-4-mercaptobutyrimidate. 

For diagnostic applications, the antibodies of the invention typically will be labeled with a detectable 
moiety. The detectable moiety can be any one which is capable of producing, either directly or indirectly, a 
detectable signal. For example, the detectable moiety may be a radioisotope, such as ^H, l4 C, ^P, ** 5 S, or *~ 5 1, , 
a fluorescent or chemi luminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin; biotin; 
15 radioactive isotopic labels, such as> e.g., *^l, ^P, * 4 C, or ^H, or an enzyme, such as alkaline phosphatase, 
beta-galactosidase or horseradish peroxidase. 

Any method known in the art for separately conjugating the antibody to the detectable moiety may be 
employed, including those methods described by Hunter, et ai. t Nature 144 :945 (1962); David, et ai, 
Biochemistry 13:1014(1974^; Pain, et ai. J. Immunol. Meth. 40:219 f 198H: and Nygren. J. Histochem. and 
20 Cvtochem. 2fl:407 ( 1 982). 

The antibodies of the present invention may be employed in any known assay method, such as 
competitive binding assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola. 
Monoclonal Antibodies: A Manual of Techniques , pp. 1 47- 158 (CRC Press, Inc., 1987). 

Competitive binding assays rely on the ability of a labeled standard (which may be a -PTP HSC 
25 polypeptide or an immunologically reactive portion thereof) to compete with the test sample analyte (PTP HSC) 
for binding with a limited amount of antibody. The amount of PTP HSC in the test sample is inversely 
proportional to the amount of standard that becomes bound to the antibodies. To facilitate determining the 
amount of standard that becomes bound, the antibodies generally are insolubilized before or after the 
competition, so that the standard and analyte that are bound to the antibodies may conveniently be separated from 
30 the standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different immunogenic 
portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte is bound by a first 
antibody which is immobilized on a solid support, and thereafter a second antibody binds to the analyte. thus 
forming an insoluble three part complex. David & Greene, U.S. Pat No. 4,376, 1 1 0. The second antibody may 
35 itself be labeled with a detectable moiety (direct sandwich assays) or may be measured using an anti- 
immunoglobulin antibody that is labeled with a detectable moiety (indirect sandwich assay). For example, one 
type of sandwich assay is an ELISA assay, in which case the detectable moiety is an enzyme. 
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(iii) 



Humanized antibodies 



Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" 
5 variable domain. Humanization can be essentially performed following the method of Winter and co-workers 
[Jones et at.. Nature 321 . 522-525 (1986); Riechmann et a/.. Nature 332 . 323-327 (1988); Verhoeyen et al. % 
Science 239 . 1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (Cabilly, 
supra ), wherein substantially less than an intact human variable domain has been substituted by the corresponding 
10 sequence from a non-human species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

It is important that antibodies be humanized with retention of high affinity for the antigen and other 
favorable biological properties. To achieve this goal, according to a preferred method, humanized antibodies 

1 5 are prepared by a process of analysis of the parental sequences and various conceptual humanized products using 
three dimensional models of the parental and humanized sequences. Three dimensional immunoglobulin models 
are commonly available and are familiar to those skilled in the art. Computer programs are available which 
illustrate and display probable three-dimensional conformational structures of selected candidate immunoglobulin 
sequences. Inspection of these displays permits analysis of the likely role of the residues in the functioning of 

20 the candidate immunoglobulin sequence, i.e. the analysis of residues that influence the ability of the candidate 
immunoglobulin to bind its antigen. In this way, FR residues can be selected and combined from the consensus 
and import sequence so that the desired antibody characteristic, such as increased affinity for the target 
antigen(s), is achieved. In general, the CDR residues are directly and most substantially involved in influencing 
antigen binding. For further details see U.S. application Serial No. 07/934,373 filed 21 August 1992. which is 

25 a continuation-in-part of application Serial No. 07/715,272 filed 14 June 1991. 

Alternatively, it is now possible to produce transgenic animals (e.g. mice) that are capable, upon 
immunization, of producing a full repertoire of human antibodies in the absence of endogenous immunoglobulin 
production. For example, it has been described that the homozygous deletion of the antibody heavy chain joining 
region (J H ) gene in chimeric and germ-line mutant mice results in complete inhibition of endogenous antibody 

30 production. Transfer of the human germ-line immunoglobulin gene array in such germ-line mutant mice will 
result in the production of human antibodies upon antigen challenge. See, e.g. Jakobovits et al. y Proc. Natl. 



35 specificities for at least two different antigens. In the present case, one of the binding specificities is for a PTP 
HSC, the other one is for any other antigen, for example an antigen expressed on the surface of a leukemia cell, 
if the antibody is an antagonist of a native PTP HSC and is used to induce differentiation of undifferentiated 
lekemia cells. If an agonist antibody specifically binding to a native PTP HSC is used to expand stem cells with 
growth factors, as hereinafter described, the second specificity could be provided by a stem cell growth factor. 




i. USA 9Q, 255 1-255 (1 993); Jakobovits et ai , Nature 362 . 255-258 ( 1 993). 
(iv) Bispecific antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
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Such constructs can also be referred to as bispecific immunoadhesins. Methods for making bispecific antibodies 

(and bispecific immunoadhesins) are known in the art. 

Traditionally, the recombinant production of bispecific antibodies is based on the coexprcssion of two 

immunoglobulin heavy chain-light chain pairs, where the two heavy chains have different specificities (Millstein 

5 and Cuello, Nature 305 . 537-539 ( 1 983)). Because of the random assortment of immunoglobulin heavy and light 

chains, these hybridomas (quadromas) produce a potential mixture of 10 different antibody molecules, of which 

only one has the correct bispecific structure. The purification of the correct molecule, which is usually done by 

affinity chromatography steps, is rather cumbersome, and the product yields are low. Similar procedures are 

disclosed in PCT application publication No. WO 93/08829 (published 13 May 1993). and in Traunecker et ai. 

10 EMBO 10, 3655-3659(1991). 

According to a different and more preferred approach, antibody variable domains with the desired 
binding specificities (antibody-antigen combining sites) are fused to immunoglobulin constant domain sequences. 
The fusion preferably is with an immunoglobulin heavy chain constant domain, comprising at least part of the 
hinge, and second and third constant regions of an immunoglobulin heavy chain (CH2 and CH3). It is preferred 

1 5 to have the first heavy chain constant region (CH 1 ) containing the site necessary for light chain binding, present 
in at least one of the fusions. DNAs encoding the immunoglobulin heavy chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are cotransfected into a suitable 
host organism. This provides for great flexibility in adjusting the mutual proportions of the three-polypeptide 
fragments in embodiments when unequal ratios of the three polypeptide chains used in the construction provide 

20 the optimum yields. It is, however, possible to insert the coding sequences for two or all three polypeptide chains 
in one expression vector when the expression of at least two polypeptide chains in equal ratios results in high 
yields or when the ratios are of no particular significance. In a preferred embodiment of this approach, the 
bispecific antibodies are composed of a hybrid immunoglobulin heavy chain with a first binding specificity in 
one arm, and a hybrid immunoglobulin heavy chain-light chain pair (providing a second binding specificity) in 

25 the other arm. It was found that this asymmetric structure facilitates the separation of the desired bispecific 
compound from unwanted immunoglobulin chain combinations, as the presence of an immunoglobulin light 
chain in only one half of the bispecific molecule provides for a facile way of separation. This approach is 
disclosed in copending application Serial No. 07/931,81 1 filed 17 August 1992. 

For further details of generating bispecific antibodies see. for example, Suresh et ai, Methods in 

30 Enzvmologv 1 2 1 . 2 1 0 ( \ 986V 

(v) Heteroconjugate antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed 
to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection 
35 (PCT application publication Nos. WO 91/00360 and WO 92/200373; EP 03089). Heteroconjugate antibodies 
may be made using any convenient cross-linking methods. Suitable cross-linking agents are well known in the 
art, and are disclosed in U.S. Patent No. 4,676,980, along with a number of cross-linking techniques. 
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H. Peptide and non-peptide analogs of polypeptide PTP HSCs 

Peptide analogs of the PTP HSC polypeptides of the present invention are modelled based upon the 
three-dimensional structure of the native polypeptides. Peptides may be synthesized by well known techniques 
such as the solid-phase synthetic techniques initially described in Merrifield, J. Am. Chem. Soc. J_5. 2 149-2 1 54 
5 (1963). Other peptide synthesis techniques are, for examples, described in Bodanszky et ai. Peptide Synthesis, 
John Wiley & Sons, 2nd Ed., 1976, as well as in other reference books readily available for those skilled in the 
art. A summary of peptide synthesis techniques may be found in Stuart and Young, Solid Phase Peptide 
Synthelia, Pierce Chemical Company, Rockford, IL ( 1 984). Peptides may also be prepared by recombinant DN A 
technology, using a DNA sequence encoding the desired peptide. 
10 In addition to peptide analogs, the present invention also contemplates non-peptide (e.g. organic) 

compounds which display substantially the same surface as the peptide analogs of the present invention, and 
therefore interact with other molecules in a similar fashion. 

I. Use of the PTP HSCs 

The PTP HSCs of the present invention are useful for a variety of purposes. For example, native PTP 
15 HSCs are useful for the identification and isolation of a PTP HSC analog in another mammalian species. Native 
PTP HSCs and their functional equivalents are also useful in screening assays designed to identify agonist of 
antagonist of native PTP HSCs. Such assays may take the form of any conventional cell-type or biochemical 
binding assay, and can be performed in a variety of assay formats well known for those skilled in the art. As 
example is the so called "two-hybrid" assay format using the Matchmaker Two-Hybrid System (Clontech) 
20 according to the manufacturer's instructions. 

The PTP HSCs of the present invention as well as their agonists can additionally be used for the 
maintenance of stem/progenitor cells in cell culture. Agonists which inhibit differentiation but allow for 
hematopoietic stem cell growth are particularly useful for this purpose, since their use results in an amplification 
of the stem cells without differentiation (self-renewal). This process might be useful, as an example, for the 
25 expansion of hematopoietic stem cells prior to autologous or heterologous bone marrow transplantation. The 
same approach can be used in vivo for the expansion of stem cells with growth factors, in the absence of 
diferentiation. 

It is believed that the native PTP HSCs of the present invention may be expressed in leukemic cells. 
Accordingly, antagonist of the PTP HSCs of the present invention may be used for the induction of 
30 differentiation of undifferentiated leukemia cells. This might allow for aggressive undifferentiated leukemia cells 
to become differentiated, which, in turn, facilitates their treatment, 

PTP HSC antagonists may also be used to induce differentiation of hematopoietic stem cells. As 
inhibition of the native PTP HSC enzyme might induce progenitor cells to differentiate, an antagonist of PTP 
HSC might act as a pan-inducer of myeloid, erythroid and lymphoid production. This use of PTP HSC 
35 antagonists may obviate or decrease the need for the use of stem cell growth factors. 

Further details of the invention are illsutrated in the following non-limiting examples. 
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Example 1 

Identification and cloning of murine PTP HSC 
A. Materials and Methods 

Isolation of embryonic lin'°CD34 n 'Sca n * hematopoietic stem cells. Yolk sacs or embryos were dissected from 
5 timed pregnant females at day 10.5. Fetal livers were isolated from day 13.5-14 embryos. Yolk sac and 

o 

embryonic tissues were dissociated with 1% collagenase in RPMI medium at 37 C for 1 5 minutes. Cells were 
further dissociated by two passages through a 16 gauge needle. Fetal liver was only dissociated by passage 
through a 16 guage needle. Adherent cells were attached to plastic by overnight incubation, after which the non 
adherent hematopoietic cells were incubated with a lineage cocktail of antibodies (1 ug each of TER 1 19, Gr-1. 

10 Ly-1, transferrin receptor and B220) for 1 hr on ice. Cells were washed, and the lineage positive cells were 
depleted using magnetic beads and a Miltenyi column. Lineage negative ceils were pelleted, resuspended in 2% 
FCS, PBS and incubated with rabbit anti-murine CD34 antibody (Baumhueter et ai. Science 262 . 436-38 
[1993]) on ice for 1 hr. Cells were washed three times in 2% FCS, PBS, resuspended in the same buffer and 
incubated with donkey, anti-rabbit FITC conjugated antibody and, in some cases, PE conjugated anti Sea 

1 5 antibody for 1 hr on ice. The cells were washed five times with 2% FCS, PBS, and than isolated by cell sorting 

on an ELITE cell sorter. ^ 
PCR analysis of mRNA isolated from lin lo CD34 ni Sca hi hematopoietic stem cells. Messenger RN A **; 
was isolated from the Lin Lo CD34 m Sca ni fraction of fetal yolk-sac hematopoietic cells (Micro-FastTrack. 
InVitrogene). Poly A+ RNA was reverse transcribed with random hexamers (Promega) and Moloney murine 

20 Leukemia virus revere transcriptase (Superscript II, GIBCO BRL). 1/4 of this cDNA was amplified by PCR 

using degenerate mixed oligonucleotides primers. Sense and antisense primers corresponding to the concensus * 
PTP amino acid sequences H / D FWRM ] / V W (5 , -A C / T TT C / T TGG A / c GlATG A / G TITGG-3') (SEQ. ID. NO: 
14, where the degenerate positions are designated by "N") and WPD F /' H GVP (5'- 
GGlAC G / A T / A G / A G / A TCIGGCCA-3') (SEQ. ID. NO: 1 5, wherein the degenerate positions are designated by 

25 "N") respectively were used. PCR were carry out in IX Taq DNA polymerase buffer (GIBCO BRL) plus 0.2 t 
mM of each dNTP, 10% DMSO and 5 units Taq polymerase (GIBCO BRL) for 25 cycles of 94° C for 1 minute, 

o o 

55 C for 1 rnin and 72 C for 1 minute. The PCR products were treated with Klenow enzyme (New England 
Biolabs) at 30 C for 30 minutes, cloned into Smal site of pRK-5 (EP 307,247, published March 15. 1989) 
plasmid, and subsequently sequenced (Sequenase, USB). 

30 cDNA and genomic cloning. Adapter- linked double strain cDN A was prepared from A+ RNA of day- 

10 murine embryos (Marathon-ready cDNA synthesize kit, Clontech) using either random hexamer or oligo dT 
primer. Full-length cDNA was isolated by 5' or 3' rapid amplification of cDNA ends (RACE) of the marathon- 
ready cDNAs. Genomic clones encoding the PTP HSC gene were isolated using standard techniques. The plaque 
purified lambda phage DNA was digested with Not 1, and the insert fragment was directly cloned without 

35 purification into Not 1 digested Bluescript. Exons were mapped using a combination of restriction digestion and 
southern blotting as well as DNA sequencing using custom primers. 

Bacterial expression of the PTP. cDNA sequences encode amino acid 8 to 323 containing the 
phosphatase domain were obtained by PCR using sense oligomer 5'- 
CACGGTCGACGGTGAGGAGCTTCTTTGAGCAGCTGGAGG-3 1 (SEQ. ID. NO: 3), and antisense oligomer 
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5 T -GTTGCGGCCGCGATTGGAGCGCAGTTCTCCTTGAGGTTCTGG-3 , (SEQ. ID. NO: 4). The PCR 

fragment was treated with Sail and Notl restriction enzyme and cloned into Sail and Notl digested pGEX-4T-l 
plasmid (Pharmacia). Fusion protein was affinity purified using a glutathione sepharose column (Pharmacia). 
Tyrosine phosphatase assays on the GST-fusion protein were carried out following the manufacture's procedure 
5 using two different tyrosine phosphorylated peptides from a tyrosine phosphatase assay kit (Boehringer 
Mannheim). 

Quantitative PCR analysis of RNA isolated from hematopoietic cells. cDNA was made from * 
RNA by reverse transcription (RT) with random hexamer. PCR was then used to amplified quantitatively PTP 
HSC cDNA and, as an internal standard, triosephosphate isomerase (TPI) cDNA. For each PCR, 6 ul of the 20 
10 ul RT reaction was brought to 50 ul so as to contain 0.3 mM of dNTPs, 4uCi of 32 P dATP (3,000Ci/mmol, 
Amersham), 100 pmol of each of the four primers, and 5 units of Taq DNA polymerase (GIBCO BRL). 

o 

Seventeen PCR cycles of 94 C for 50 seconds, 55° C for 50 seconds, and 70° C for 70 seconds. One-tenth of 
each PCR samples was electrophoresed in a 6% polyacrylamide gel, and the PCR products were quantitate by 
phosphorimaging (Fuji). Conditions for accurate quantitation of either PTP HSC or TPI were assessed in 

1 5 experiments that used serial dilutions of a standard preparation of A + RNA from 32D cells to determine for each 
primer pair the times of primer annealing and primer extension and the cycles that provided for a linear 
correlation between the amount of template RNA and the PCR product. Under the PCR conditions ultimately 
chosen, certain amount of sample RNA was analyzed simultaneously with serial dilutions of the standard RNA 
and a reverse transcriptase minus control. 

20 Northern blot analysts of tissues and cell lines. A Sall-NotI 1 .3 kb PTP HSC cDNA fragment was 

used to probe murine multi-tissue northern blot (Clontech). The same northern blot was used with various other 
probes, all of which demonstrated detectable, undegraded transcripts. 
PCR primer pairs 

5' RACE primers: antisense primer 5'-CCTGGAGGGTCCTGAGAGTGATGTCTGCATTCAGTG-3* (SEQ. ID. 
25 NO: 5), 5 , -CCTCTTGGAGCAGGGAAAGGATGACTCTTGTCTC-3' (SEQ. ID. NO: 6), 5*- 
CAGCTGCTCCAAGAAGCTCCTCACCAAGTC-3* (SEQ. ID, NO: 7). Sense primer: API and AP2 
(Clontech). 

3RACE primers: sense primer 5'-GGTAGAGGTGGGCAGGGTGAAGTGTTCTCGC-3' (SEQ. ID. NO: 8), 5 - 
CACTGAATGCAGACATCACTCTCAGGACCCTCCAGG-3' (SEQ, ID. NO: 9), 5'- 
30 GAGACAAGAGTCATCCTTTCCCTGCTCCAAGAGG-3* (SEQ. ID. NO: 10). Antisense primer: API and 
AP2 (Clontech). 

Quantitative RT-PCR primers: PTP HSC sense primer 5- 

CACTGAATGCAGACATCACTCTCAGGACCCTCCAGG-3* (SEQ. ID. NO: 9), antisense primer 5- 
GAATGGTAACCTGGAGGGTCCTGAG-3' (SEQ. ID. NO: II). TPI sense primer 5- 
35 GAGAAGGTCGTGTTCG AG (SEQ. ID. NO: 1 2), antisense primer 5'-GTGTACTTCCTGTGCCTG-3' (SEQ. 
ID. NO: 13). 

B. cDNA cloning of PTPs from Hematopoietic Stem Cells 

In order to analyze PTPs potentially involved with the maintenance of the hematopoietic stem cell, we 
isolated a highly purified population of these cells from either the murine 10.5 day yolk sac or embryo. 
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Previously, we showed that both progenitor activity as well as stromal cell repopulating activity were found in 

theCD34 m fraction of these embryonic ceils [3] (C. Fennie and L. Lasky-unpublished observations). In addition, 

others have shown that the murine CD34 m population isolated from bone marrow (Krause et aL, Blood 84(3), 

691-701 [1994]), or fetal liver (Ziegler et aL, Blood 84, 2422-2450 [1994]) contains stem cells capable of 

5 reconstituting lethally irradiated animals. In order to isolate a more highly purified fraction of these progenitor 

cells, we included a lineage depletion step as well as a positive selection step with the Sea antibody (Uchida et 

aL, Blood 83H2). 3758-3779 [1994]), in addition to the CD34 antibody. These morphologically primitive 

hematopoietic cells show a higher degree of stromal cell repopulating ability as well as cobblestone formation 

as compared to the previously described CD34 m progenitor cells, and we are currently investigating their in vivo 

10 repopulating activity (C. Fennie and L. Lasky-unpublished observations). Previous investigators have shown that 

the lin^Sca m fraction of bone marrow hematopoietic cells has a high level of repopulating activity (Sprangrude 

et aL, Science 241 . 58-62 [1988]). Thus, it is likely that the Iin lo CD34 m Sca hl cells isolated from the early 
embryo contain self renewing hematopoietic stem cells (Uchida et aL, supra; Krause et aL. supra; Ziegler et aL. 
supra. 

15 Consensus PCR using primers derived from two highly conserved regions of the PTP phosphatase ^ 

domain resulted in the cloning and sequencing of ~ 70 PCR fragments. As shown in Table 1 , a diversity of known 
receptor and non-receptor PTPs were detected in this fraction of these progenitor cells, and many of these PTPs 
have not previously been described in the hematopoietic stem cell compartment. Two novel PTPs (referred to 
in the table as PTP 38 and PTP 49) were also isolated. One is a receptor PTP which is related to the 

20 homotypically interacting u., k and LAR family and is the subject of a patent application filed concurrently 
herewith. The second PTP was found to be most homologous to two previously described non-receptor PTPs, 
murine PTP PEP (Matthews et aL, Mol. Cell Biol. 12(5), 2396-2405 [1992]) and murine/human PTP PEST 
(Takekawa et aL, Biochem. Biophvs. Res. Commun. I§2(2), 1223-1230 [1992]; Yang et aL, J. Biol. Chem. 
268(23) 17650 11993): and Chareste/ aL. Biochem J. 308 (2\ 425-432 [1995]), both of which contain a region 

25 that is very high in proline, glutamate, serine and threonine (the "PEST 1 domain). One of these PTPs, PEP, has 
been demonstrated to be localized to the nucleus (Flores et aL y supra) (see below), so it appeared that the novel 
PTP fragment may have been a new member of this potentially nuclear-localized PTP family. 

Initial PCR and northern analyses with the PTP fragment revealed that the transcript encoding this 
enzyme is extremely rare in embryonic and adult tissues. Thus, the full length cDNA was cloned using the RACE 

30 procedure and RNA isolated from day 10 embryos. Because the RACE cloning of the 5 prime region was 
particularly difficult, the final 5 prime sequence was confirmed using the genomic clone encoding this PTP. As 
can be seen in figure 1, this transcript encodes an open reading frame of 453 amino acids specifying a protein 
of molecular weight 50,253 daltons. Homology searches revealed that the region encoding amino acids 25-290 
were highly homologous to a variety of PTPs, with the highest degree of homology with murine PTP PEP 

35 (Matthews et aL, supra) and murine/human PTP PEST (Takekawa et aL, supra; Yang et aL, supra; and Charcst 
etaL t supra) (figure 2). Interestingly, PTP PEP has also been found to be expressed in mature hematopoietic 
cells (Matthews et aL t supra, Flores et aL, supra) although human and murine PTP PEST appear to have a more 
generalized expression pattern (Yang et aL, supra; Charest et aL, supra). As has been shown in these two 
previously described PTPs, the novel PTP reported here contains a region 3 prime of the PTP domain which is 
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very rich in proline, serine, and threonine (-29%) (boxed residues in figure I ). This region lacks other significant 

homology with PTPs PEP and PEST, and it is also much shorter in the novel PTP described here. Finally, a 

short region of 20 amino acids at the very carboxy terminus of the protein is highly homologous to similar 

carboxy-terminal regions in PTPs PEP and PEST (figure 2). This region is rich in basic residues and the 

5 homologous area in PTP PEP has been shown to be involved with the localization of this enzyme to the nucleus 

(Flores et ai. supra). However, this region also contains two negatively charged residues (arrowheads in figure 

2). so it is likely that this novel PTP is a cytoplasm ically localized enzyme, as has been demonstrated for PTP 

PEST (Charest et ai. supra). Finally, the novel PTP described here contains a serine residue at position 37 

(shown starred in figure 2) which is conserved in all three members of this family and which has been shown to 

10 be phosphorylated in PTP PEST by protein kinases C and A (Garton and Tonks, EMBO J. 11(16), 3763-71 
[1994]). Interestingly, increased phosphorylation at this site is inhibitory to the PTPase activity of this PTP 
( Ban v i 1 le et ai . Genom ics 2 1( W 165-173 [1995]). In summary, the novel PTP described here appears to be a 
new member of a family of non-receptor PTPs which contain P, S and T rich regions (figure 3), In addition, all 
three of these PTPs contain a homologous carboxy-terminal region which has been shown to function as a nuclear 

1 5 localization signal for one of the family members (PTP PEP), although the murine PEST enzyme has been found 
to localize to the cytoplasm. 

Previous analyses of the genomic structures of other PTPs suggested that these enzymes were 
constructed from genes containing a large number of introns. This appears to be the case for the novel PTP 
described here as well. As can be seen from figure 4, the hematopoietic progenitor cell PTP gene is subdivided 

20 by 14 introns. Analysis of the tntronic structure of this novel PTP as compared with that found for other PTPs 
suggests that the novel progenitor cell enzyme is divided into a comparable number of coding exons (for 
example, Banville et ai, supra). In addition, as described below, there appears to be at least one other smaller 
transcript, as well as a heterogeneous collection of large transcripts, suggesting that alternate splicing may occur 
in this gene. Finally, chromosomal localization studies have demonstrated that the gene encoding the human 

25 form of this PTP is found on chromosome 14 (D. Dowbenko and L. Lasky, unpublished data). 

While the sequence of the N-terminal PTP domain contained many of the conserved amino acids found 
to be critical for substrate recognition and tyrosine dephosphorylation (Jia et al. % supra), it was important to 
demonstrate that this sequence indeed encoded an active PTP domain . To this end we produced a construct using 
the glutathione-S-transferase (GST) fusion system which contained the entire PTP-homologous region derived 

30 from the novel cDNA clone. The protein was isolated from induced cultures of bacteria, and it was tested for 
the dephosphorylation of tyrosine using two different phosphorylated peptides (see materials and methods). As 
can be seen from figure 5, the isolated GST-PTP domain fusion protein had a very high level of PTP activity, 
with significant dephosphorylation at only 20 picograms of enzyme per reaction, which was partially sensitive 
to inhibition by orthovanadate. The only partial inhibition of enzyme activity by orthovanadate was likely due 

35 to the high level of activity as well as insufficient levels of the inhibitor. These data indicate that this 
hematopoietic progenitor cell PTP is an active tyrosine phosphatase. 

C. Expression of the progenitor cell PTP transcript 

The isolation of the novel PTP from the lin*°CD34 m sca^' population of hematopoietic stem cells 
suggested that this PTP might be specific for very early progenitor cells. As figure 6A illustrates, quantitative 
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PCR comparing the levels of the transcript encoding this PTP in the lin lo CD34 m sca n \ a largely undifferentiated 
population containing hematopoietic stem cells (Spangrude et aL, supra, Krause et al, supra: Zeigler et a/.. 
Blood £4(8), 2422-2430 [1994]), versus the lin Io CD34 ni sca l0 population, a more differentiated cell population 
(Spangrude et al., supra), containing committed progenitors, demonstrated that there was an approximately 1 0 
5 fold lower level of the transcript in the more differentiated sca io cells. In order to examine if this 
downregulation continued as differentiation progressed, quantitative PCR was performed using RNA isolated 
from suspension cultures of lin lo CD34 m sca m cells that were exposed to IL-l, IL-3, EPO and GM-CSF for 
various periods of time in the absence of stromal cells. Analysis of cell numbers, together with Wright-Giemsa 
staining of the cultures, revealed that the undifferentiated lin*°CD34 m sca ni cell population dramatically 

10 expanded in the presence of these growth and differentiation factors and also metamorphosed along the myeloid 
pathway to ultimately give rise to cultures that contained predominately macrophages after 14 days (data not 
shown). As figure 6B illustrates, the transcript encoding the novel PTP disappears as the cells replicate and 
develop, and it is completely absent after approximately 7 days in culture. These data are consistent with a role 
for this PTP in early stem or progenitor cells, but not in the mature, committed cell populations. 

1 5 The potential importance of this PTP specifically to the hematopoietic system is illustrated in figure 7A 

where northern blot analyses of various tissues and cell lines are shown. As can be seen from this figure, the 
transcript appears to be undetectable in the embryonic samples, and it is expressed at exceedingly low levels in 
adult lung and kidney. Thus, while there are clearly hematopoietic stem cells in the embryo, they must be so rare 
as to not allow for the direct detection of the transcript encoding the novel PTP. Particularly interesting is the 

20 lack of a signal in the RNA isolated from the adult spleen, a hematopoietic compartment that contains 
predominately mature, differentiated hematopoietic cells and which was previously shown to express PTP PEP 
(Matthews et al., supra). The very faint transcripts detected in the lung have been confirmed by non-quantitative 
PCR analysis (J. Cheng and L. Lasky-unpublished data). However, the transcripts in the lung are very rare and 
may be aberrant, since screening of an adult lung library ( ixlO^ clones) resulted in only two positive isolates, 

25 both of which contained introns (J. Cheng and L. Lasky-unpublished observations). 

The lack of detectable signal in most tissues of the adult and embryo, coupled with the identification 
of the transcript in the highly purified stem cell population, but not in the differentiated hematopoietic cells, 
suggested that this PTP might be expressed in hematopoietic progenitor cell lines. As figure 7B illustrates, the 
transcripts encoding this novel PTP are easily detectable in the three different murine hematopoietic progenitor 

30 cell lines tested by both northern and PCR analyses. In all three cases, these lines represent relatively 
undifferentiated precursors of mature hematopoietic cells, although they are certainly not self-renewing stem 
cells. The cells appear to encode two major transcripts, in addition to a diversity of minor transcripts. One major 
transcript is an -1 .8 kB RNA that corresponds to the cDNA clone described above, while the other encodes a 
-0.7 kB RNA that remains to be characterized. However, it is likely that this smaller transcript is due to 

35 alternative splicing, since, as described above, the gene encoding this PTP is divided into a large number of 
exons (Figure 4). Figure 7C illustrates that the PTP HSC transcript is undetectable by PCR in a differentiated 
T cell clone, a result which is again consistent with the downregulation of this PTP in differentiated cells. Finally, 
PCR analysis of various human cell lines using the murine primer pair revealed expression of a similarly sized 
fragment in human CMK progenitor cells, and the sequence of this PCR fragment revealed that the human 
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homologue is highly conserved with the murine PTP (J. Cheng, Kai Wu and L. Lasky-unpublished results). In 

summary, the novel PTP described here appears to be expressed predominately in very early hematopoietic 

progenitor cells, consistent with a potential role in the regulation of the differentiation state of these cells. 

D. Discussion 

5 The ability of the hematopoietic stem cell to self renew in the absence of differentiation is an important 

factor which allows for this cell to provide a large number of progeny throughout the lifetime of the organism. 
The maintenance of the undifferentiated state must occur at the same time as the stem cell replicates, since this 
cell type must be continually replenished. Thus, there must be specific mechanisms that decrease some aspects 
of cellular activation, such as differentiation, while not affecting others, such as division. Because tyrosine 

10 phosphorylation is a critical aspect of cellular activation, based upon the results disclosed herein, it is likely that 
distinctive mechanisms which regulate tyrosine phosphorylation are involved with the maintenance of the self 
renewing stem cell. Such specificity can be accomplished in part by the expression of the appropriate growth 
factors by the hematopoietic cell stroma. However, another means by which such regulation can occur is by the 
dephosphorylation of a subset of tyrosine phosphorylated proteins. One mechanism that would allow for specific 

1 5 dephosphorylation is via PTPs which recognize only a fraction of the tyrosine phosphorylated proteins in the cell. 
Thus, the analysis of PTPs expressed by hematopoietic stem cells might further our understanding of the 
mechanisms by which stem cell self renewal is attained. The non-receptor PTP described in the present 
application has some of the features that might be expected for a regulator of stem cell differentiation. 

Several aspects of this novel PTP, which is referred to throughout the specification and claims as the 

20 PTP of hematopoietic stem cells or PTP HSC, are consistent with a role in the regulation of aspects of early 
hematopoietic progenitor cell biology. First, the specific expression of the transcript in very early hematopoietic 
progenitor cells, together with the down-regulation of the message as the cells differentiate, is compatible with 
a role for this enzyme in physiological aspects of the less differentiated stem cell. While little is understood 
regarding the regulation of genes in very early hematopoietic progenitor cells, the apparently unique expression 

25 of this gene predominately in these comparatively undifferentiated cells suggests that novel mechanisms of 
transcriptional regulation might be utilized in the control of this locus (Orkin, Curr. Opin. Cell Biol. 7(6), 870- 
877 [1995]). In addition, the predominate lack of expression of this PTP in most adult tissues, with the exception 
of extremely low levels in the lung and the kidney, is also consistent with a role for this enzyme specifically 
within the hematopoietic progenitor cell compartment. This is in stark contrast to the expression of PTP PEP, 

30 which is found in the lymphoid compartment (Takekawa et a!., supra), and PTP PEST, which is apparently 
ubiquitously expressed in a number of cell lines and tissues (Yang et al., supra). Second, the PTP domain can 
be thought of as a moderator of cell activation by virtue of its ability to dephosphorylate tyrosine residues. 
Tyrosine phosphorylation can either up- or down-regulate the activities of various proteins (Fantl el al.. supra) % 
so that the PTP HSC might activate or inhibit a specific subset of tyrosine phosphorylated proteins. In a cell that 

35 requires a down-regulation of differentiation, this type of specific modulation would allow for the control of the 
phosphotyrosine levels of proteins activated by various growth factors produced by the hematopoietic stroma. 
Together, these data are compatible with a function for this enzyme in the modulation of development of the stem 
cell that is induced by the various growth factors produced by the hematopoietic microenvironment. 
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The hypothesis that PTPs such as PTP HSC are involved with the maintenance of an undifferentiated 

state in the hematopoietic stem cell suggests possibilities regarding the substrates recognized by this type of PTP. 

Several of the substrates for the PTPs have been previously characterized. For example, the alpha PTP. a 

receptor PTP, has been found to regulate the levels of src tyrosine phosphorylation which results in 

5 differentiation of neuronal progenitor cells. Lar, as well as CD45, are apparently involved with the regulation 

of the tyrosine phosphorylation levels of the insulin receptor (Kulas et ai, J. Biol, Chem. . 271 (2). 748-754 

(1996); Kulas et ai. J. Biol. Chem. 27 \(2\ 755-760 { 1 996]). From the standpoint of hematopoiesis, the SH 2 

domain containing PTP 1C phosphatase has been shown to be critically involved with the regulation of myeloid 

development in the motheaten mouse as well as with the activation state of the EPO receptor (Schulz et ai. 

10 supra, McCulloch (Klingmuller et ai, supra). Finally, another SH2-containing PTP, PTP 1 D has been found 
to positively regulate the activity of the prolactin receptor (Ali et ai, EMBO J. 15(11 135-142 [1996]). These 
examples, among others, are consistent with a role for cytoplasmically -localized PTP domains in the regulation 
of a variety of cellular processes. However, the nature of the substrates recognized by the rarer nuclear PTP 
family is unknown. The dual specificity (i.e. tyrosine and serine/threonine dephosphorylation) phosphatase 

15 encoded by the cdc25 locus is a nuclear enzyme that is critical for the regulation of mitosis (Gautier et ai. Cell 
62(1), 197-211 [1991]). In addition, PAC-1 , another nuclear localized PTP, appears to be involved with the 
regulation of the mitogen activated protein kinases. A recently described dual specificity phosphatase, TYP 1 . 
related to the vaccinia virus VH 1 phosphatase, appears to be involved with the regulation of both the ERK and 
JNK family of mitogen activated protein kinases (King et ai., Oncogene 1 1 . 2553-2563 [1995]). These data 

20 suggest that several currently described phosphatases appear to play roles in the regulation of tyrosine 
phosphorylated nuclear proteins. 

Another possible substrate for both the nuclear and cytoplasmic PTP enzymes are the STAT proteins. 
These transcriptional activators encompass a family of at least 6 different members, all of which are activated 
by the JAK tyrosine kinases (Darnell et ai. Science 264 (5164). 141501421 [1994]; Ihle et ai, Annu. Rev. 

25 Immunol. JJ, 369-398 [1995]). JAK phosphorylation is stimulated by the formation of receptor complexes that 
are stimulated by the binding of various hematopoietic and other growth factor-like molecules (Darnell et ai, 
supra). The phosphorylated STAT proteins than dimerize, migrate to the nucleus and bind specifically to various 
DNA elements that regulate the transcription of growth and differentiation genes (Shuai ei ai t Science 
261(5129). 1744-1746 [1993]; Heim et ai t Science 262(5202), 1347-49 [1995]). Thus, because these 

30 transcription factors are linked with the activation of hematopoietic differentiation factors, they provide appealing 
targets for negative regulation in hematopoietic stem cells. The absolute requirement for tyrosine 
phosphorylation of these transcriptional activators thus suggests that the novel PTP reported here could regulate 
STAT activation via dephosphorylation of tyrosine residues. In this manner, the upregulation of genes specific 
to the differentiated state could be inhibited by the dephosphorylation of one or more activated STAT molecules. 

35 This hypothesis is especially appealing in the case of the hematopoietic stem cells. In this case, the activation 
of the STAT proteins by the binding of various hematopoietic growth and differentiation factors, a state which 
would induce terminal differentiation, could be downregulated by a stem cell specific PTP such as PTP HSC. 
If this hypothesis is correct, the manner by which specific STAT dephosphorylation occurs must be investigated. 
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However, it is possible that the proline, serine, threonine rich domain of PTP HSC might function to bind to only 

a subset of STATs. 

Finally, recent data have shown that PTP PEST can associate with the p52 snc and p66 snc SH2- 
containing adaptorproteins in a protein kinase C dependent fashion (Habib et ai, J. Biol. Chem. 269 (4 1 ), 25243- 
5 25246 [1994]). This association was through an interaction between the N-terminal region of SHC and the 
carboxy-terminal P,S,T rich region of the PTP PEST. The fact that this association was enhanced by protein 
kinase C suggested that serine or threonine phosphorylation might be involved, and a serine in the P,S,T rich 
region of PTP PEST is known to be phosphorylated by protein kinase C (Garton and Tonka, supra). 
Interestingly, carbachol, an activator of G protein coupled signaling, was also able to stimulate this association, 
1 0 suggesting that PTP PEST may be involved with the cross talk between G coupled and tyrosine kinase pathways 
Because of the similarity of PTP HSC to PTP PEST, we suggest that the novel hematopoietic cell PTP of the 
present invention may also interact with SHC, and we are currently examining this possibility using the yeast two 
hybrid system. 

In summary, the data disclosed in this example suggest that hematopoietic stem/progenitor cells 
1 5 specifically express a PTP which appears to be downregulated as the cells differentiate. The PTP seems to be 
predominately specific to hematopoietic progenitor cells, suggesting an important role in the development of this 
cell compartment. However, while these data are potentially important, a number of studies remain to be 
accomplished. Thus, the possibility that the STATs are substrates for this enzyme, the possible interaction of 
the enzyme with SHC, the constitutive expression of the enzyme in transfected cells and in transgenic animals, 
20 and the effects of null mutations at this locus in vivo may provide for further insights into the mechanisms by 
which stem cell self renewal is regulated. 
Example 2 

Cloning of a human PTP HSC 

Two oligonucleotides (sense: 5 , ACTTGGTGAGGAGCTTCTTGGAGCAGCTGGAGG3' (SEQ. ID. 

25 NO: 20), and antisense: 5'GGAATGTAACCTGGAGGGTCCTGA3' (SEQ. ID. NO: 2!)) were used as PCR 
primers with reverse transcribed RNA isolated from human CMK hematopoietic progenitor cells. The conditions 
for PCR were identical to those described in Example 1 for the isolation of the PCR fragment encoding murine 
PTP HSC. The PCR fragment was subcloned into pBS (Bluescript) plasmid, and the DNA sequence was 
determined as described for the murine sequence in Example 1. The partial nucleotide sequence and deduced 

30 amino acid sequence of the human PTP HSC are shown in Figure 8. 

Example? 

Expression of the murine and human PTP HSC 

The native murine PTP HSC polypeptides are expressed in mammalian cells using standard techniques. 
Briefly, a DNA fragment encoding the entire PTP HSC is ligated into an expression vector (e.g. PRK5). The 
35 expression vector is then transfected into mammalian cells (e.g. embryonic kidney 292 cells), and the protein 
expression is determined using a monoclonal or polyclonal antibody directed against the native PTP HSC to be 
expressed. 

All documents cited throughout this application as well as the documents cited therein are hereby 
expressly incorporated by reference. 
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Table 1 

PTPs expressed in lin*° CD34^' hematopoietic progenitor cells 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc. 
(ii> TITLE OF INVENTION: Protein Tyrosine Phosphatases 
(iii) NUMBER OF SEQUENCES: 23 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Bivci 

(C) CITY: South San Francisco 
10 ( 0} STATE: California 

( E } COUNTRY: USA 

(F) ZIP: 94080 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 
15 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS- DOS 

(D) SOFTWARE: WinPatin (Genentech) 



(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
20 (B) FILING DATE: 

(C) CLASSIFICATION : 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Dreger, Ginger R. 

(B) REGISTRATION NUMBER: 33,055 

25 (C) REFERENCE/ DOCKET NUMBER : P1C10PCT 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415/225-3216 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 



30 (2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNES3 : Single 
35 (D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CTCAGAGCGG GTCGCAGCAT GAGTCGCCAT ACGGACTTGG TGAGGAGCTT 50 



CTTGGAGCAG CTGGAGGCCC GGGACTACCG GGAGGGGGCA ATCTTCGTTC 100 

GTGAGTTCAG CGACATTAAG GCCCGCTCAG TGGCCTGGAA GTCTGAAGGT 150 

40 GTGTGTTCCA CTAAAGCCGG CAGTCGGCTT GGGAACACGA ACAAGAACCG 20 0 

CTACAAAGAT GTGGTAGCAT ATGATGAGAC AAGAGTCATC CTTTCCCTGC 250 

TCCAAGAGGA GGGACATGGA AATTACATCA ATGCCAACTT CATCCGGGGC 300 

ATAGATGGAA GCCAGGCCTA CATTGCGACG CAAGGACCCC TGCCTCACAC 350 



ACTGTTGGAC TTCTGGCGCC TGGTTTGGGA GTTTGGGGTC AAGGTAATCC 4 00 
45 TGATGGCCTG TCAAGAGACA GAAAATGGAC GGAGGAAGTG TGAACGCTAT 4 50 
TGGGCCCGGG AGCAGGAGCC TCTAAAGGCT GGGCCTTTCT GCATCACCCT 500 
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GACAAAGGAG ACAACACTGA ATGCAGACAT CACTCTCAGG ACCCTCCAGG 5 50 

TTACATTCCA GAAGGAATTC CGCTCTGTGC ACCAACTACA GTATATGTCG 600 

TGGCCAGACC ACGGGGTTCC CAGCAGTTCT GATCACATTC TCACCATGGT 650 

GGAGGAGGCC CGCTGCCTCC AAGGGCTTGG ACCTGGACCC CTCTGTGTCC 700 

5 ACTGCAGTGC TGGCTGCGGA CGAACAGGTG TCCTGTGCGC TGTTGACTAT 7 50 

GTGAGGCAGT TGCTGCTGAC CCAGACAATC CCTCCCAACT TCAGTCTCTT 300 

CCAAGTGGTC CTGGAGATGC GGAAACAGCG GCCTGCAGCA GTGCAGACAG 8 50 

AGGAGCAGTA CAGGTTCCTG TACCACACAG TGGCTCAGCT ATTCTCCCGC 900 

ACTCTCCAGG ACACCAGCCC CCAATACCAG AACCTCAAGG AGAACTGCGC 950 

10 TCCAATCTGC AAGGAAGCTT TCTCCCTCAG GACCTCCTCA GCCCTGCCTG 1000 

CCACATCCCG GCCACCAGGA GGGGTTCTCA GGAGCATCTC GGTGCCTGCG 10 50 

CCCCCGACCC TCCCCATGGC TGACACTTAC GCTGTGGTGC AGAAGCGTGG 1100 

CGCTTCGGCG GGCACAGGGC CGGGGCCGCG GGCGCCCACC AGCACGGACA 1150 

CCCCGATTTA CAGCCAGGTG GCTCCACGTG CCCAGCGACC GGTGGCACAC 1200 

15 ACGGAGGACG CACAGGGGAC AACGGCACTG CGCCGAGTTC CTGCGGACCA 12 50 

AAACTCTTCC GGGCCTGATG CCTACGAAGA AGTAACAGAT GGAGCACAGA 1300 

CTGGAGGGCT AGGCTTCAAC TTGCGCATCG GAAGGCCCAA AGGGCCCCGG 13 50 

GATCCTCCAG CAGAGTGGAC ACGGGTGTAA CGAGTGCTGT GCCAGTTATA 14 00 

GCCTGCCACT CGGTGGTGGC TGGACTCCTG GAACCACCAT ACTGCTGTGC 14 50 

20 AGTGTGTTAT GTATGAGTGG GACTTGTGGG CCTGATTCAA AATAAAAGTT 1500 

TCTCAGGGCG GAAAAAAAAA AAAAAAAAA 152 9 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Arg His Thr Asp Leu Val Arg Ser Phe Leu Glu Gin Leu 
1 5 10 15 

30 Giu Aia Arg Asp Tyr Arg Giu Gly Ala lie Phe Val Arg Glu Phe 

20 25 30 

Ser Asp lie Lys Ala Arg Ser Val Ala Trp Lys Ser Glu Gly Val 
35 40 45 

Cys .Ser Thr Lys Ala Gly Ser Arg Leu Gly Asn Thr Asn Lys Asn 
35 50 55 60 

Arg Tyr Lys Asp Val Val Ala Tyr Asp Glu Thr Arg Val lie Leu 

65 70 75 
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Ser Leu Leu Gin Glu Glu Gly His Gly Asn Tyr lie Asn Ala Asn 

80 85 9C 

Phe lie Arg Gly lie Asp Gly Ser Gin Ala Tyr lie Aia Thr Gin 

95 100 105 

5 Gly Pro Leu Pro His Thr Leu Leu Asp Phe Trp Arg Leu Vai Trp 

110 115 120 

Glu Phe Gly Vai Lys Vai lie Leu Met Ala Cys Gin Glu Thr Glu 

125 130 135 

Asn Gly Arg Arg Lys Cys Glu Arg Tyr Trp Ala Arg Glu Gin Glu 

0 140 145 150 

Pro Leu Lys Ala Gly Pro Phe Cys lie Thr Leu Thr Lys Glu Thr 

155 160 165 

Thr Leu Asn Ala Asp lie Thr Leu Arg Thr Leu Gin Vai Thr Phe 

170 175 180 

5 Gin Lys Glu Phe Arg Ser Vai His Gin Leu Gin Tyr Met Ser Tro 

185 190 195 

Pro Asp His Giy Vai Pro Ser Ser Ser Asp His lie Leu Thr Met 

200 205 210 

Vai Glu Glu Ala Arg Cys Leu Gin Gly Leu Gly Pro Gly Pro Leu 

0 215 220 225 

Cys Vai His Cys Ser Ala Gly Cys Gly Arg Thr Gly Vai Leu Cys 

230 235 240 

Ala Vai Asp Tyr Vai Arg Gin Leu Leu Leu Thr Gin Thr lie Pro 

245 250 255 

5 Pro Asn Phe Ser Leu Phe Gin Vai Vai Leu Glu Met Arg Lys Gin 

260 265 270 

Arg Pro Ala Ala Vai Gin Thr Glu Glu Gin Tyr Arg Phe Leu Tvr 

275 280 285 

His Thr Vai Ala Gin Leu Phe Ser Arg Thr Leu Gin Asd Thr Ser 

0 290 295 " 300 

Pro Gin Tyr Gin Asn Leu Lys Glu Asn Cys Ala Pro lie Cys Lys 

305 310 315 

Glu Ala Phe Ser Leu Arg Thr Ser Ser Ala Leu Pro Ala Thr Ser 

320 325 330 

5 Arg Pro Pro Gly Gly Vai Leu Arg Ser lie Ser Vai Pro Ala Pro 

335 340 345 

Pro Thr Leu Pro Met Ala Asp Thr Tyr Ala Vai Vai Gin Lys Arg 

350 355 360 

Gly Ala Ser Ala Gly Thr Gly Pro Gly Pro Arg Ala Pro Thr Ser 

0 365 370 375 

Thr Asp Thr Pro lie Tyr Ser Gin Vai Ala Pro Arg Ala Gin Arg 

380 385 390 

Pro Vai Ala His Thr Glu Asp Ala Gin G.l y Thr Thr Ala Leu Arq 

395 400 405 
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Arg Val Pro Ala Asp Gin Asn Ser Ser Giy Pro Asp Ala Tyr Glu 
410 415 420 

Glu Val Thr Asp Gly Ala Gin Thr Giy G.iy Leu Gly Phe Asn Leu 
425 430 435 

5 Arg lie Gly Arg Pro Lys Gly Pro Arg Asp Pro Pro Ala Glu Trp 

440 445 450 

Thr Arg Val 
453 

(2) INFORMATION FOR SEQ ID NO : 3 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base Dairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CACGGTCGAC GGTGAGGAGC TTCTTTGAGC AGCTGGAGG 3 9 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 42 base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GTTGCGGCCG CGATTGGAGC GCAGTTCTCC TTGAGGTTCT GG 4 2 
25 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CCTGGAGGGT CCTGAGAGTG ATGTCTGCAT TCAGTG 36 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 34 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
40 CCTCTTGGAG CAGGGAAAGG ATGACTCTTG TCTC 34 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 
45 (C) STRANDEDNESS: Single 
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(>;i) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CAGCTGCTCC AAGAAGCTCC TCACCAAGTC 30 
(2) INFORMATION FOR SEQ ID NO : 8 : 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 
JO STRANDEDNESS : Single 
(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGTAGAGGTG GGCAGGGTGA AGTGTTCTCG C 31 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
15 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CACTGAATGC AGACATCACT CTCAGGACCC TCCAGG 3 6 
20 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
25 ( D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAGACAAGAG TCATCCTTTC CCTGCTCCAA GAGG 34 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
35 GAATGGTAAC CTGGAGGGTC CTGAG 2 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAGAAGGTCG TGTTCGAG 18 
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(2) INFORMATION FOR SEQ ID NO : 1 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTGTACTTCC TGTGCCTG 18 

(2) INFORMATION FOR SEQ ID NO: 14: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ANTTNTGGNG ATGNTTGG 18 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
20 ( B ) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

GGACNNNNTC GGCCA 15 
25 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 

GCGCGGGGCG GCCGGGAGGG GGCAGTCCTC GCCGGCGAGT TCAGCGACAT 50 

CCAGGCCTGC TCGGCCGCCT GGAAGGCTGA CGGCGTGTGC TCCACCGTGG 100 

CCGGCAGTCG GCCAGAGAAC GTGAGGAAGA ACCGCTACAA AGACGTGCTG 150 

35 CCTTATGATC AGACGCGAGT AATCCTCTCC CTGCTCCAGG AAGAGGGACA 2 00 

CAGCGACTAC ATTAATGGCA ACTTCATCCG GGGCGTGGAT GGAAGCCTGG 2 50 

CCTACATTGC CACGCAAGGA CCCTTGCCTC ACACCCTGC7 AGACTTCTGG 300 

AGACTGGTCT GGGAGTTTGG GGTCAAGGTG ATCCTGATGG CCTGTCGAGA 350 

GATAGAGAAT GGGCGGAAAA GGTGTGAGCG GTACTGGGCC CAGGAGCAGG 4 00 

40 AGCCACTGCA GACTGGGCTT TTCTGCATCA CTCTGATAAA GGAGAAGTGG 4 50 
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CTGAATGAGG ACATCA 4 66 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 
5 (B) TYPE : Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 7 : 

Ala Arg Gly Gly Arg Glu Gly Ala Val Leu Ala Gly Glu Phe Ser 

lb 10 15 

10 Asp lie Gin Ala Cys Ser Ala Ala Trp Lys Ala Asp Gly Val Cys 

20 25 3C 

Ser Thr Val Ala Gly Ser Arg Pro Glu Asn Val Arq Lys Asn Arg 

35 40 " 45 

Tyr Lys Asp Val Leu Pro Tyr Asp Gin Thr Arg Val lie Leu Ser 

15 50 55 60 

Leu Leu Gin Glu Glu Gly His Ser Asp Tyr He Asn Gly Asn Phe 

65 70 75 

He Arg Gly Val Asp Gly Ser Leu Ala Tyr lie Ala Thr Gin Gly 

80 85 90 

20 Pro Leu Pro His Thr Leu Leu Asp Phe Trp Arg Leu Val Trp Glu 

95 100 105 

Phe Gly Val Lys Val He Leu Met Ala Cys Arg Glu He Glu Asn 

110 . 115 120 

Gly Arg Lys Arg Cys Glu Arg Tyr Trp Ala Gin Glu Gin Glu Pro 

25 125 130 135 

Leu Gin Thr Gly Leu Phe Cys lie Thr Leu He Lys Glu Lys Trp 

140 145 150 

Leu Asn Glu Asp He 
155 

30 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Phe Ala Ser Glu Phe Leu Lys Leu Lys Arg Gin Ser Thr Lys Tyr 

15 10 15 

Lys Ala Asp Lys He Tyr Pro Thr Thr Val Ala Gin Arg Pro Lys 

20 25 30 

40 Asn lie Lys Lys Asn Arg Tyr Lys Asp He Leu Pro Tyr Asp His 

3 5 4 0 4 5 

Ser Leu Val Glu Leu Ser Leu Leu Thr Ser Asp Glu Asp Ser Ser 

50 55 60 

Tyr He Asn Ala Ser Phe He Lys Giv Val Tyr Gly Pro Lys Ala 

45 65 7:; 7b 
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Tvr lie Ala Thr Gin Gly Pro Leu Ser Thr Thr Leu Leu Asp Phe 

80 85 90 

Trp Arg Met lie Trp Glu Tyr Arg He Leu Val He Val Met Ala 

95 10C 105 

5 Cvs Met Glu Phe Glu Met Gly Lys Lys Lys Cys Glu Arg Tyr Trp 

110 115 120 

Ala Glu Pro Gly Glu Thr Gin Leu Gin Phe Gly Pro Phe Ser He 

125 130 135 

Ser Cys Glu Ala Glu Lys Lys Lys Ser Asp Tyr Lys lie Arg Thr 

10 140 145 150 

Leu Lys Ala Lys Phe Asn Asn Glu Thr Arg He He Tyr Gin Phe 

155 160 165 

His Tyr Lys Asn Trp Pro Asp His Asp Val Pro Ser Ser He Asp 

170 ' ' 175 180 

15 Pro He Leu Gin Leu He Trp Asp Met Arg Cys Tyr Gin Glu Asp 

185 190 195 

Asp Cys Val Pro He Cys He His Cys Ser Ala Gly Cys Gly Arg 

200 205 210 

Thr Gly Val lie Cys Ala Val Asp Tyr Thr Trp Met Leu Leu Lys 

20 215 220 225 

Asp Gly He He Pro Lys Asn Phe Ser Val Phe Asn Leu He Gin 

230 235 240 

Glu Met Arg Thr Gin Arg Pro Ser Leu Val Gin Thr Gin Glu Gin 

245 250 255 

25 Tyr Glu Leu Val Tyr Ser Ala Val Leu Glu Leu Phe Lys Arg His 

260 265 270 

Met Asp Val He Ser Asp Asn His 

275 278 

(2) INFORMATION FOR SEQ ID NO : 1 9 : 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 

35 Phe Ala Arg Asp Phe Met Arg Leu Arg Arg Leu Ser Thr Lys Tyr 

1 5 10 15 

Arg Thr Glu Lys He Tyr Pro Thr Ala Thr Gly Glu Lys Glu Glu 

20 25 _ 30 

Asn Val Lys Lys' Asn Arg Tyr Lys Asp He Leu Pro Phe Asp His 

40 35 4 0 4 5 

Ser Arg Val Lys Leu Thr Leu Lys Thr Pro Ser Gin Asp Ser Aso 

50 55 60 

Tyr He Asn Ala Asn Phe He Lys Gly Val Tyr Gly Pro Lys Ala 

65 ■ 70 7b 
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Tyr Val Ala Thr Gin Gly Pro Leu Ala Asn Thr Val lie Asp Phe 

8C 85 90 

Trp Arg Met Val Tro Glu Tyr Asn Val Val lie lie Val Met Ala 

95 100 105 

5 Cys Arg Glu Phe Glu Met Gly Arg Lys Lys Cys Glu Arq Tyr Trp 

110 115 " 12Q 

Pro Leu Tyr Gly Glu Asp Pro lie Thr Phe Ala Pro Phe Lvs lie 
125 130 ' 135 

Ser Cys Glu Asp Glu Gin Ala Arg Thr Asp Tyr Phe lie Arg Thr 
10 140 145 150 

Leu Leu Leu Glu Phe Gin Asn Glu Ser Arg Arg Leu Tyr Gin Phe 
155 160 165 

His Tyr Val Asn Trp Pro Asp His Asp Val Pro Ser Ser Phe Asp 
170 175 180 

15 Ser lie Leu Asp Met lie Ser Leu Met Arg Lys Tyr Gin Glu His 

185 190 195 

Glu Asp Val Pro He Cys He His Cys Ser Ala Gly Cys Gly Arg 
200 205 210 

Thr Gly Ala He Cys Ala He Asp Tyr Thr Trp Asn Leu Leu Lys 
20 215 220 225 

Ala Gly Lys lie Pro Glu Glu Phe Asn Val Phe Asn Leu He Gin 
230 235 240 

Glu Met Arg Thr Gin Arg His Ser Ala Val Gin Thr Lys Glu Gin 

245 250 255 

25 Tyr Glu Leu Val His Arg Ala He Ala Gin Leu Phe Glu Lys Gin 

260 265 270 

Leu Gin 
272 

(2) INFORMATION FOR SEQ ID NO: 20: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ACTTGGTGAG GAGCTTCTTG GAGCAGCTGG AGG 3 3 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
40 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGAATGTAAC CTGGAGGGTC CTGA 2 4 
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{2} INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 
■A) LENGTH: 19 amino acids 
(S) TYPE: Amino Acid 
5 { D ) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Gly Phe Gly Asn Arg Phe Ser Lys Pro Lvs Gly Pro Arg Asn Pro 
1 5 10 15 

Pro Ser Ala Trp 
10 19 

(2) INFORMATION FOR SEQ ID NO : 2 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: Amino Acid 
15 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

lie Gly Phe Gly Asn Arg Cys Gly Lys Pro Lys Gly Pro Arg Asp 
1 5 10 15 

Pro Pro Ser Glu Trp Thr 
20 20 21 



BNSDOCID: <WO 973501 9A1> 



-45- 



WO 97/3501 9 PCT/US97/05278 

Claims: 

1 . An isolated non-receptor protein tyrosine phosphatase of hematopoietic stem cells (PTP HSC). 

which 

( 1 ) is expressed predominantly in early hematopoietic stem cells or progenitor cells; 
5 (2) predominantly lacks expression in adult tissues; 

(3) comprises an N-terminal tyrosine phosphatase domain, followed by a region rich in 
serine, threonine, and proline, and a carboxy terminal region of about 15 to 25 amino acids rich in basic amino 
acid residues; and 

(4) is capable of tyrosine dephosphorylation in hematopoietic stem cells or progenitor 

10 cells. 

2. The PTP HSC of claim I which is murine. 

3. The PTP HSC of claim I which is human. 

4. The PTP HSC of claim 1 or a derivative thereof, which downregulates STAT activation. 

5. An antagonist of the PTP HSC of claim 1 . 
15 6. An antagonist of the PTP HSC of claim 4. 

7. An isolated non -receptor protein tyrosine phosphatase of hematopoietic stem cells (PTP HSC) 
selected from the group consisting of: 

(1) a protein comprising the amino acid sequence shown in Figure 1 (SEQ. ID. NO:2); 

(2) a protein comprising the amino acid sequence shown in Figure 8 (SEQ. ID. NO: 17); 
20 (3) a mammalian homoiogue of protein ( I ) or protein (2); and 

(4) a derivative of proteins ( 1 ) - (2) retaining the ability of tyrosine dephosphorylation 
in hematopoietic stem cells or progenitor cells. 

8. The PTP HSC of claim 7 comprising an active N-terminal tyrosine phosphatase domain, 
retaining a serine residue at a position corresponding to amino acid position 37 in Figure I, a region rich in 

25 serine, threonine, and proline, retaining an active site cysteine residue at a position corresponding to amino acid 
position 229 in Figure 1, and a carboxy-terminal region showing at least about 80% sequence homology with 
the amino acid sequence between positions 430 and 45 1 in Figure 1 , said derivative having an at least about 65% 
overall sequence homology with the amino acid sequence shown in Figure 1 and retaining the ability of tyrosine 
dephosphorylation in hematopoietic progenitor cells. 
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9. The PTP HSC of claim 7, comprising the amino acid sequence shown in Figure 1 (SEQ. ID. 

NO; 2), or in Figure 8 (SEQ. ID. NO: 1 7). 

10. An antagonist of the PTP HSC of claim 7. 

11. An isolated nucleic acid molecule encoding the PTP HSC of claim 1 . 
5 12. An isolated nucleic acid molecule encoding the PTP HSC of claim 7. 

13. An isolated nucleic acid molecule encoding the PTP HSC of claim 1 1 . 

14. A vector comprising the nucleic acid molecule of claim I 1 operably linked to control 
sequences recognized by a host cell transformed with the vector. 

15. A host cell transformed with the vector of claim 13. 

10 16. An antibody capable of specific binding to the PTP HSC of claim 7. 

17. A hybridoma cell line producing an antibody of claim 15. 

1 8. An assay for identifying an antagonist or agonist of a PTP HSC of claim 1 , which comprises 
contacting the phosphatase domain of said PTP HSC with a candidate antagonist or agonist, and^monitoring 
the ability of said phosphatase domain to dephosphorylate tyrosine residues. 

15 19 An assay for identifying an antagonist or agonist of a PTP HSC of claim 1 , which comprises 

cultivating a PTP HSC-expressing hematopoietic stem or progenitor cell line in the presence of a candidate 
antagonist or agonist, and monitoring the differentiation of the stem or progenitor cells. 

20. A method for the differentiation of undifferentiated malignant hematopoietic cells, comprising 
contacting said cells with an antagonist of a PTP HSC according to claim 7. 

20 21. The method of claim 1 9 wherein said cells are leukemia cells. 

22. A method for the induction of differentiation of stem cells, comprising contacting said cells 
with an antagonist of a PTP HSC according to claim 7. 

23. A method for the expansion undifferentiated stems cells in cell culture, comprising cultivates 
stem cells in the presence of a PTP HSC according to claim 7 or an agonist antibody specifically binding a native 

25 PTP HSC. 

24. A method for the expansion of undifferentiated stem cells in vivo comprising administering 
to a patient an agonist of a PTP HSC according to claim 7 or an agonist antibody specifically binding a native 
PTP HSC, and a hematopoietic growth factor. 
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661 CGCTGCCTCC AAGGGCTTGG ACCTGGACCC CTCTGTGTCC ACTGCAGTGC TGGCTGCGGA 

215 RCLQ GLG PGP LCVH CSA GCG 

721 CGAACAGGTG TCCTGTGCGC TGTTGACTAT GTGAGGCAGT TGCTGCTGAC CCAGACAATC 

235 RTGV LCA VDY VRQL LLT QTI 

781 CCTCCCAACT TCAGTCTCTT CCAAGTGGTC CTGGAGATGC GGAAACAGCG GCCTGCAGCA 

255 PPNF SLF Q V V LEMR KQR PAA 

841 GTGCAGACAG AGGAGCAGTA CAGGTTCCTG TACCACACAG TGGCTCAGCT ATTCTCCCGC 

275 VQTE E Q Y RFL YHTV AQL FSR 

901 ACTCTCCAGG ACACCAGCCC CCACTACCAG AACCTCAAGG AGAACTGCGC TCCAATCTGC 

295 TLQD TSP HYQ NLKE N C A [g I C 

9 61 AAGGAAGCCT TCTCCCTCAG GACCTCCTCA GCCCTGCCTG CCACATCCCG GCCACCAGGA 
315KEAF 1 L R HUH AL[g|A E H R @ B G 

1021 GGGGTTCTCA GGAGCATCTC GGTGCCTGCG CCCCCGACCC TCCCCATGGC TGACACTTAC 

335 GVLR i I | V H A [| | @ L [p)MA D @ Y 

1081 GCTGTGGTGC AGAAGCGTGG CGCTTCGGCG GGCACAGGGC CGGGGCCGCG GGCGCCCACC 

355 A V V Q KRG a(Ua G £3 G (2 G 1 R A g] 0 

1141 AGCACGGACA ccccgatcta cagccaggtg gctccacgtg cccagcgacc ggtggcacac 

37 5 03 E D El IE I Y D Q V aBrA QR[g VAH 

1201 acggaggacg cacaggggac aacggcactg cgccgagttc ctgcggacca aaactcttcc 

395 E]eda q g E Hal rrvSadq n (s) H 

1261 gggcctgatg cctacgaaga agtaacagat ggagcacaga ctggagggct^aggcttcaac 

415 G H] D A YEE V 0 D G A Q (t) G Grp^j^F^^; . 



132 
4 



2 1 OTGCC^ : ' A&QGCCCCGG T4SATCCTCCAG CAGAGTGGAC^ ACGGGTGTAA 

35 tt^&. *'Sg&<*^K^J-^ ^M^^^Ej^S^^^lL^i^^^ R V O 

1381 CGAGTGCTGT GCCAGTTATA GCCTGCCACT CGGTGGTGGC TGGACTCCTG GAACC AC CAT 
14 41 ACTGCTGTGC AGTGTGTTAT GTATGAGTGG GACTTGTGGG CCTGATTCAA AATAAAAGTT 
1501 TCTCAGGGCA GAAAAAAAAA AAAAAAAAA 

Figure 1 
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Figure 5 
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Figure 6 A 
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Figure 7B 
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Figure 7C 
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GCGCGGGGCG GCCGGGAGGG GGCAGTCCTC 
CCAGGCCTGC TCGGCCGCCT GGAAGGCTGA 
CCGGCAGTCG GCCAGAGAAC GTGAGGAAGA 
CCTTATGATC AGACGCGAGT AATCCTCTCC 
C AGCGAC T AC ATTAATGGCA ACTTCATCCG 
CCTACATTGC CACGCAAGGA CCCTTGCCTC 
AGACTGGTCT GGGAGTTTGG GGTCAAGGTG 
GATAGAGAAT GGGCGGAAAA GGTGTGAGCG 
AGCCACTGCA GACTGGGCTT TTCTGCATCA 
CTGAATGAGG ACATCA 466 
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Figure 8A 

GCCGGCGAGT TC AGCGAC AT 5 0 
CGGCGTGTGC TCCACCGTGG 100 
ACCGCTACAA AGACGTGCTG 150 
CTGCTCCAGG AAGAGGGACA 200 
GGGCGTGGAT GGAAGCCTGG 2 50 
ACACCCTGCT AGACTTCTGG 3 00 
ATCCTGATGG CCTGTCGAGA 3 50 
GTACTGGGCC CAGGAGCAGG 4 00 
C TC TG AT AAA GGAGAAGTGG 450 
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Fiffure 8B 

Ala Arg Gly Gly Arg Glu Gly Ala Val Leu Ala Gly Glu Phe Ser 

15 10 15 

Asp lie Gin Ala Cys Ser Ala Ala Trp Lys Ala Asp Gly Val Cys 

20 25 30 

Ser Thr Val Ala Gly Ser Arg Pro Glu Asn Val Arg Lys Asn Arg 

35 40 45 

Tyr Lys Asp Val Leu Pro Tyr Asp Gin Thr Arg Val lie Leu Ser 

50 55 60 

Leu Leu Gin Glu Glu Gly His Ser Asp Tyr lie Asn Gly Asn Phe 

65 70 75 

lie Arg Gly Val Asp Gly Ser Leu Ala Tyr lie Ala Thr Gin Gly 

80 85 90 

Pro Leu Pro His Thr Leu Leu Asp Phe Trp Arg Leu Val Trp Glu 

95 100 105 

Phe Gly Val Lys Val lie Leu Met Ala Cys Arg Glu He Glu Asn 

110 115 120 

Gly Arg Lys Arg Cys Glu Arg Tyr Trp Ala Gin Glu Gin Glu Pro 

125 130 135 

Leu Gin Thr Gly Leu Phe Cys He Thr Leu He Lys Glu Lys Trp 

140 145 150 

Leu Asn Glu Asp lie 
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(57) Abstract 

This invention concerns new non-receptor protein tyrosine phosphatases of the hematopoietic stem cells (PTP HSC). The invention 
specifically concerns native murine and human PTP HSCs, their analogs in other mammals, and their functional derivatives. The invention 
further relates to nucleic acid encoding these proteins, vectors containing and capable of expressing such nucleic acid, and recombinant host 
cells transformed with such nucleic acid. Assays for identifying agonists and antagonists of the native PTP HSCs, methods for expansion 
of undifferentiated stem cells, and methods for the induction of stem cell differentiation are also within the scope of the invention. 
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PROTEIN TYROSINE PHOSPHATASES OF HEMATOPOIETIC CELLS 

Field of the Invention 

The present invention concerns novel protein tyrosine phosphatases. More particularly, the invention 
concerns non-receptor protein tyrosine phosphatases of hematopoietic stem cells (PTP HSC's). 
5 Background of the Invention 

The ability of the hematopoietic stem cell to function as a source of committed progenitors throughout 
the lifetime of the organism is, at present, a poorly understood phenomenon. The major characteristic of the 
hematopoietic stem cell is its ability to self renew in the absence of differentiation (Morrison et at, Ann. Rev. 
Cell Dev. Biol. . J_L, 35-71 [1995]). This self renewal phenomenon is especially remarkable in light of the fact 
10 that the hematopoietic stroma, which is in close physical contact with the stem cell, is known to be a source that 
is rich in factors which mediate the growth and differentiation of hematopoietic progenitors (Deryugina and 
Muller-Sieberg, Crit. Rev, in Immunol. U(2), 115-150 [1993]). For example, a recent PCR analysis of 
hematopoieticaily active endothelial cell stromal lines derived from the murine yolk sac revealed that these cells 
produced a plethora of growth and differentiation factors including stem cell factor, FLT 3 ligand. M-CSF. LIF 
15 and IL-6 (Fennie et a!., Plood 86(12). 4454-4467 [1995]). Such growth factors, in addition to many others, arc 
known to induce the expansion and differentiation of stem cells, and these endothelial cell lines induced a rapid 
expansion and differentiation of embryonic hematopoietic stem cells along the myeloid pathway, although very 
early progenitor cells are also amplified by these stromal cells (C Fennie and L. Lasky-unpublished data). It has 
also been shown that incubation of highly purified stem cell populations in the presence of various purified 

20 hematopoietic growth factors induces differentiation with subsequent loss of the cells' ability to competitively 
repopulate the hematopoietic compartment of lethally irradiated animals, consistent with the induction of terminal 
differentiation (Peters g/a/., B1qq<J &Z(1): 30-37 [1996]). Thus, the stem cell, whether in an embryonic or adult 
stromal environment, must maintain an undifferentiated state in spite of the fact that it is being exposed to a 
variety such maturation factors (Deryugina and Muller-Sieberg, supra). 

25 Although the hematopoietic growth factors are very diverse both structurally and functionally, they are 

all believed to play a role in mediating protein phosphorylation (Paulson and Bernstein, Semin Immunol . 7(4), 
267-77 [1995]). This protein modification can occur via direct means, such as in the cases of the stem cell factor 
and FLT-3 receptors, both of which have intrinsic tyrosine kinase activity, or via indirect means, as is the case 
of the hematopoietic/cytokine growth factor receptors for, for example, 1L-3, EPO and TPO. In the case of the 

30 hematopoietic/cytokine growth factor receptors, tyrosine phosphorylation is indirectly accomplished by the 
activation of the JAK kinases, which occurs after growth factor mediated receptor dimerizat ion (Ihle e\ al, Annu. 
Rey. Immunol. 11, 369-398 [1995]). In both cases, diverse complex pathways of protein phosphorylation are 
stimulated upon receptor binding. The intrinsic tyrosine kinase receptors mediate their signals via an elaborate 
series of tyrosine phosphorylation events which ultimately activate the RAS signaling pathway (Fantl et al Ann. 

35 Rev. Biochem. £2, 453-481 [1993]). This pathway eventually leads to the activation of the serine/threonine 
specific MAP kinase pathway which results in transcriptional activation. In contrast to this intricate pathway, 
hematopoietic growth factor- induced receptor dimerization mediates more direct activation events. Thus, the 
stimulation of the JAK kinases by receptor binding leads to the tyrosine phosphorylation and subsequent- 
dimerization of various STAT proteins. These activated STAT proteins than migrate to the nucleus, bind to 

40 STAT responsive sites in the nuclear DNA and induce transcription of differentiation and growth specific iienes 

-I- 



BNSDOCID: <WO 973501 9A1 > 



WO 97/35019 PCT/US97/05278 

Thus, a major effect of the growth factors produced by the hematopoietic stroma is to mediate the activation of 

various cellular pathways by protein phosphorylation. 

The regulation of protein tyrosine phosphorylation is accomplished by a balance between protein 

tyrosine kinases and protein tyrosine phosphatases (PTPs) (Walton and Dixon, Ann. Rev. Biochem. 62 . 1 0 1 - 1 20 

5 [1993]; Sun and Tonks, Trends Biochem. Sci. . J9( I I ), 480-485 1 1994]). All PTPs contain a phosphatase domain 

including a subset of highly conserved amino acids, and a recent crystal structure analysis of PTP I B complcxcd 

with a tyrosine phosphorylated peptide revealed that many of these conserved residues are involved with 

substrate recognition and tyrosine dephosphorylation (Jia et aL, Science 268 (52 1 8). 1754-1758 [1995]). PTPs 

fall into two general categories: receptor type and non-receptor type. The receptor type PTPs have variously 

1 0 sized extracellular domains and, generally, two intracellular phosphatase domains Walton and Doxin, supra: Sun 
and Tonks, supra. The extracellular domains often contain a number of motifs that are generally utilized in cell 
adhesion including immunoglobulin domains and flbronectin-like regions . Many of these PTPs appear to 
function as homotypic and heterotypic sensors of the extracellular space, and they have been hypothesized to play 
roles in contact inhibition, cell guidance and other intercellular functions (Brady-Kalnay and Tonks. Curr. Opin. 

15 Cell. Biol. 7(5). 65-657 [1995]). The non-receptor PTPs are generally intracellular enzymes. They have various 
cellular localizations, depending upon the types of domains they contain, and some of the enzymes contain SI 12 
motifs which allow them to interact intimately with phosphotyrosine residues. While many of the non-receptor 
PTPs are in various cytoplasmic locations, a small number of these enzymes are found in the nucleus (Flores ei 
al.. Mol. Cell. Biol. 14(7). 4938-46 [1994]). Many non-receptor PTPs appear to function as both activators as 

20 well as inhibitors of diverse tyrosine phosphorylated proteins. A subset appear to play important roles in 
hematopoiesis. For example, the motheaten mouse, which has a phenotype of lethal myeloid amplification and 
inflammation, has been found to have a mutation in the PTP 1C gene (Schulz et ai. y Cell 23(7), 1445-54 [1993]); 
(McCulloch and Siminovitch, Adv. Exp. Med. Biol. 365 , 145-54 [1994]). In addition, the level of tyrosine 
phosphorylation of the EPO receptor, as well as the level of receptor activation, appears to be in part controlled 

25 by the PTP 1C enzyme as well (Klingmuller et al. y £eJ!S0(5), 729-38 [1995]). However, while these examples, 
as well as others, highlight the potential importance of the PTPs, very little is known regarding the physiological 
importance of these enzymes. 

Summary of the Invention 

We have hypothesized that one mechanism by which the undifferentiated state of the stem ce;. might 
30 be maintained is by the dephosphorylation of tyrosine phosphorylated proteins by PTPs. In order to examine 
this possibility, we have analyzed a large number cf PTPs from a very primitive embryonic hematopoietic cell 
population using consensus PCR. From this population we have cloned a novel intracellular PTP which has 
many of the characteristics, including down-regulation of the transcript as the hematopoietic stem cells 
differentiate, which might be expected from a PTP involved with the control of differentiation signals such as 
35 those induced by hematopoetic growth factors. We have designated this novel PTP as the "PTP of hematopoietic 
stem cells", which will be referred to hereafter as "PTP HSC." 

Accordingly, the present invention concerns an isolated non-receptor protein tyrosine phosphatase of 
hematopoietic stem cells (PTP HSC), which 

(I) is expressed predominantly in early hematopoietic stem/progenitor cells; 
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(2) predominantly lacks expression in adult tissues; 

(3) comprises an N-tcrminal tyrosine phosphatase domain, followed by a region rich in serine, 
threonine, and proline, and a carboxy terminal region of about 15 to 25 amino acids rich in basic amino acid 
residues; and 

5 (4) is capable of tyrosine dephosphorylation in hematopoietic stem cells or progenitor cells. 

This novel PTP preferably downregulates STAT activation. A preferred group of the PTP HSC proteins 
of the present invention includes a protein comprising the amino acid sequence shown in Figure 1 (SEQ. ID, 
NO:2); a protein comprising the amino acid sequence shown in Figure 8 (SEQ. ID. NO: 17), a further 
mammalian homologue of either protein; and derivatives of the foregoing proteins retaining the ability of 
10 tyrosine dephosphorylation in hematopoietic stem cells or progenitor cells. 

The PTP HSCs, including derivatives (e.g. amino acid sequence variants) of the native proteins, 
preferably have an active N-tcrminal tyrosine phosphatase domain, retaining a serine residue at a position 
corresponding to amino acid position 37 in Figure 1, and retaining an active site cysteine residue at a position 
corresponding to amino acid position 229 in Figure I, a region rich in serine, threonine, and proline, and a 
15 carboxy-terminal region showing at least about 80% sequence homology with the amino acid sequence between 
positions 430 and 45 1 in Figure 1 . Most preferably, such derivatives have at least about 65% overall sequence 
homology with the amino acid sequence shown in Figure 1 or Figure 8 and retain the ability of tyrosine 
dephosphorylation in hematopoietic stem ceils or progenitor cells. 

In another aspect, the present invention concerns agonists and antagonists of PTP HSCs. a* 
20 In yet another aspect, the invention concerns isolated nucleic acid molecules encoding the PTP HSCs 

herein. 

In a further aspect, the invention concerns vectors comprising nucleic acid encoding the PTP HSCs 
herein, operably linked to control sequences recognized by a host cell transformed with the vector, and to cells 
transformed with such vectors. ^ 

25 In a still further aspect of the present invention, there are provided antibodies capable of specific 

binding to the PTP HSCs of this invention, and hybridoma cell lines producing such antibodies. The antibodies 
may be agonist antibodies, which stimulate the ability of the native PTP HSCs of the present invention to 
dephosphorylate tyrosines, or antagonist antibodies, which block this activity. 

The present invention further concerns an assay for identifying an antagonist or an agonist of a PTP 

30 HSC of the present invention, which comprises contacting the phosphatase domain of the PTP HSC with a 
candidate antagonist or agonist, and monitoring the ability of the phosphatase domain to dephosphorylate 
tyrosine residues. 

In another embodiment, the invention concerns an assay for identifying an antagonist or agonist of a 
PTP HSC of the present invention by cultivating a PTP HSC-expressing hematopoietic stem cell line or 
35 progenitor cell line in the presence of a candidate antagonist or agonist, and monitoring the differentiation of the 
progenitor cells. 

The invention further concerns a method for the differentiation of undifferentiated malignant 
hemopoietic (e.g. leukemia) cells, comprising contacting said cells with an antagonist of a PTP HSC of the 
present invention. 
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In an additional aspect, the invention concerns a method for the induction of hematopoietic stem cell 

differentiation, comprising contacting said stem cells with an antagonist of a PTP HSC of the present invention. 

In another aspect, the invention concerns a method for expansion undifferentiated hematopoietic stems 

cells in cell culture, comprising cultivating stem cells in the presence of a PTP HSC of the present invention or 

5 an agonist antibody specifically binding a native PTP HSC. 

In yet another aspect, the invention concerns a method for the expansion of undifferentiated stem cells 

in vivo comprising administering to a patient an agonist of PTP HSC of the present invention or an agonist 

antibody specifically binding a native PTP HSC, and a stem cell growth factor. 

Brief Description of the Drawings 

10 Figure 1. DNA and deduced protein sequence of the murine PTP HSC cDNA. illustrated is the 

DNA sequence (SEQ. ID. NO: 1) and deduced protein sequence (SEQ. FD. NO: 2) of the murine PTP HSC 
cDNA. The overlined region is the phosphatase homologous domain. The asterisk denotes the active site 
cysteine residue. The P,S,T-rich region is illustrated by boxes around these residues. The shaded carboxy 
terminal region is homologous to a nuclear localization signal found on murine PTP PEP (Flores et al.. Mol. Cell. 

15 BioL 14m. 4938-46 [1994]). 

Figure 2. Sequence homologies of murine PTP HSC, murine PTP PEP, and human PTP PEST, 
A. The phosphatase domain homologies show that these three proteins are highly related to each other. A star 
over the residue (amino acid 37 of PTP HSC) illustrates a conserved serine that is phosphorylated by protein 
kinases A and C and which appears to negatively regulate PTPase activity (Garton and Tonks, EMBQ J. _L3( 16), 

20 3763-71 [1994]). The amino acid sequence of positions 24 - 301 of PTP PEP is shown in SEQ ID. NO: 18; the 
amino acid sequence of positions 24 - 299 of PTP PEST is shown in SEQ. ID. NO: 19. B. A second highly 
homologous region is found at the carboxy terminus of these three proteins (SEQ. ID. NO: 22 showing amino 
acids 783 - 803 of PTP PEP; SEQ. ID. NO: 23 showing amino acids 76 1 - 781 of PTP PEST). This region has 
been shown to confer nuclear localization on PTP PEP. Interestingly PTP PEST is localized to the cytoplasm. 

25 and it has been hypothesized that this is due to the two negatively charged residues shown by the arrows. As can 
be seen, PTP HSC also contains these negatively charged residues, suggesting that it is also localized to the 
cytoplasm. 

Figure 3. The PTP PST family. Illustrated are the three so far identified members of this family 
including the currently described novel PTP (PTP HSC). Shown are the amino terminal PTP domains (black), 
30 the P,S,T rich domains, and the carboxy terminal nuclear localization homology (shaded). 

Figure 4. Intron sites superimposed or ***e PTP HSC domain structure. Analysis of the gene 
encoding PTP HSC revealed the location of 14 intruns that are shown as triangles in this figure. 

Figure 5. In vitro tyrosine phosphatase activity of the PTP HSC. Shown is the enzymatic activity 
obtained using isolated, bacterially produced GST-phosphatase domain of PTP HSC. Black squares, serial 
35 dilutions of GST-PTP HSC in the absence of orthovanadate; white squares, enzymatic activity of GST-P TP HSC 
in the presence of vanadate; closed circle, enzymatic activity of GST alone; open circles enzymatic activity with 
an inactive GST-PTP (J. Cheng and L. Lasky-unpublished data). The initial undiluted reaction contained 2 ug 
of each protein. 
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Figure 6. PCR an*lysis of PTP HSC expression. A. lin lo CD34 hi sca hi or lin lo CD34 hi sca'° 
hematopoietic progenitor cells were isolated from murine embryos at day 1 1 of development. RNA was isolated 
and analyzed by quantitative PCR. The upper band corresponds to the PTP HSC transcript while the lower band 
corresponds to the triose phosphate isomerase (TPI) internal standard. B. lin'°CD34 ni sca ni hematopoietic 
5 progenitor/stem cells were purified from murine fetal liver and incubated for up to 14 days in IL-s, IL-s, EPO 
and GM-CSF. RNA was isolated at various times and analyzed by quantitative PCR as described in A. 

Figure 7. PTP HSC Transcript analysis in embryonic and adult tissues and hematopoietic cell 
lines. A. Illustrated is a tissue northern blot probed with a cDNA encoding PTP HSC. The left panel illustrates Z' 
RNA isolated from variously aged embryos, while the right panel illustrates RNA isolated from: a. heart, b., 
10 brain, c. spleen, d. lung, e. liver, f. skeletal muscle, g. kidney, h. testis. B. Illustrated is a northern blot of RNA 
isolated from BAF 3 (a), 32D (b) and FDCP (c) hematopoietic progenitor cells. Also shown is the ethidium 
bromide stain of the same gel prior to transfer. C. PCR analysis of RNA isolated from BAF 3 (a), 32 D (b), T 
cell clone (c), FDCP (d), 1 1 day embryos (e) and a control with no reverse transcriptase (0 *** 
Figure 8. Partial DNA and deduced protein sequence of the human PTP HSC cDNA. Illustrated 
1 5 is the partial DNA sequence (SEQ. ID. NO: 1 7 ) and deduced protein sequence (SEQ. ID. NO: 1 8) of the human %?■■ 
PTPHSCcDNA. 

Petaikt} Description Qf the Invenfjpn 

A. Definitions 

The phrases "non-receptor protein tyrosine phosphatase of hematopoietic stem cells", "tyrosine ^ v 
20 phosphatase of hematopoietic stem cells'* and "PTP HSC** are used interchangeably and refer to a native 
intracellular protein tyrosine phosphatase which (1) is expressed predominantly in early hematopoietic stem and 
progenitor cells; (2) predominantly lacks expression in adult tissues; (3) comprises an N-terminal 

tyrosine phosphatase domain, followed by a region rich in serine, threonine, and proline, and a carboxy terminal . 
region of about 15 to 25 amino acids rich in basic amino acid residues; and (4) is capable of tyrosine ~ v 
25 dephosphorylation in hematopoietic progenitor cells, and functional derivatives of such native tyrosine *■ 
phosphatase. 

The term "native tyrosine phosphatase 11 in this context refers to a naturally occurring tyrosine 
phosphatase, having the described properties, of any human or non-human animal species, with or without the 
initiating methionine, whether purified from native source, synthesized, produced by recombinant DNA 

30 technology or by any combination of these and/or other methods. Native PTP HSCs specifically include the 
native murine and native human HSC proteins (SEQ. ID. NOs: 2 and , respectively). 

A "functional derivative" of a polypeptide is a compound having a qualitative biological activity in 
common with the native polypeptide. Thus, a functional derivative of a native PTP HSC polypeptide is a 
compound that has a qualitative biological activity in common with a native PTP HSC. "Functional derivatives" 

35 include, but are not limited to, fragments of native polypeptides from any animal species (including humans), 
derivatives of native (human and non-human) polypeptides and their fragments, and peptide and non-peptide 
analogs of native polypeptides, provided that they have a biological activity in common with a respective native 
polypeptide. "Fragments" comprise regions within the sequence of a mature native polypeptide. The term 
"derivative" is used to define amino acid sequence variants, and covalent modifications of a native polypeptide. 



-5- 
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"Non-peptide analogs" arc organic compounds which display substantially the same surface as peptide analogs 

of the native polypeptides. Thus, the non-peptide analogs of the native PTP HSCs of the present invention are 

organic compounds which display substantially the same surface as peptide analogs of the native PTP HSCs 

Such compounds interact with other molecules in a similar fashion as the peptide analogs, and mimic a biological 

5 activity of a native PTP HSC of the present invention. The polypeptide functional derivatives of the native PTP 

HSCs of the present invention preferably have an active N-terminal tyrosine phosphatase domain, retaining a 

serine residue at a position corresponding to amino acid position 37 in Figure I, and retaining an active site 

cysteine residue at a position corresponding to amino acid position 229 in Figure I; a region rich in serine. 

threonine, and proline; and a carboxy- terminal region showing at least about 80% sequence homology with the 

1 0 amino acid sequence between positions 430 and 45 1 in Figure 1 . Preferably, such derivatives have at least about 

65%. more preferably at least about 75 %, even more preferably at least about 85%, most preferably at least 

about 95% overall sequence homology with the amino acid sequence shown in Figure 1 (SRQ. ID. NO: 2) or 

Figure 8 (SEQ. ID. NO: 18) and retain the ability of tyrosine dephosphorylation in hematopoietic progenitor 

cells. 

1 5 The term "biological activity" in the context of the definition of functional derivatives is defined as the 

possession of at least one adhesive, regulatory or effector function qualitatively in common with a native 
polypeptide (e.g. PTP HSC). The functional derivatives of the native PTP HSCs of the present invention art- 
unified by their qualitative ability of tyrosine dephosphorylation in hematopoietic progenitor cells. In addition, 
the functional derivatives of the native PTP HSCs herein preferably are capable of downregulating STAT 

20 activation. 

The term "agonist" is used to refer to peptide and non-peptide analogs of the native PTP HSCs of the 
present invention and to antibodies specifically binding such native PTP HSCs provided that they retain the 
qualitative ability of tyrosine dephosphorylation in hematopoietic progenitor cells. 

The term "antagonist" is used to refer to a molecule inhibiting the ability of a PTP HSC of the present 
25 invention to dephosphorylate tyrosines. Preferred antagonists essentially completely block tyrosine 
dephosphorylation caused by a PTP HSC. 

"Identity" or "homology" with respect to a native polypeptide and its functional derivative is defined 
herein as the percentage of amino acid residues in the candidate sequence that are identical with the residues of 
a corresponding native polypeptide, after aligning the sequences and introducing gaps, if necessary, to achieve 
30 the maximum percent homology, and not considering any conservative substitutions as part of the sequence 
identity. Neither N- or C-terminal extensions nor insertions shall be construed as reducing identity or homology. 
Methods and computer programs for the alignment are well known in the art. 

The term "stem cell" is used in the broadest sense to describe cells which arc not terminally 
differentiated and have the ability to divide throughout the lifetime of the organism, yielding some progeny thai 
35 differentiate and others that remain stem cells, including stem cells of any tissue type, such as the lining of the 
gut. the epidermal layer of the skin and the blood-forming tissues. 

The term "hematopoietic stem cell" is used in the broadest sense to refer to stem cells from which blood 
cells derive, including pluripotent stem cells, lymphoid and myeloid stem cells. 
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The term "hematopoietic progenitor cell" refers to the progeny of a pluripotent hematopoietic stem cell 

which are committed for a particular line of differentiation. These committed progenitor cells are irreversibly 

determined as ancestors of only one or a few blood cell types, e.g. erythrocytes or granulocytes. 

"Hematopoietic growth factors" are growth factors that influence blood cell formation or differentiation 
5 in vw>, such as EPO, TPO, IL-3, IL-6, stem cell growth factor, M-CSF, G-CSF, GM-CSF, FTL 3 figand, LIF, 
etc., unified by their role in mediating protein phosphorylation. The receptors of these growth factors arc either 
transmembrane tyrosine kinases or are members of the cytokine receptor family. 

Ordinarily, the terms "amino acid" and "amino acids" refer to all naturally occurring L-a-amino acids. 
In some embodiments, however, D-amino acids may be present in the polypeptides or peptides of the present 
10 invention in order to facilitate conformational restriction. For example, in order to facilitate disulfide bond 
formation and stability, a D amino acid cysteine may be provided at one or both termini of a peptide functional 
derivative or peptide antagonist of the native PTP HSC's of the present invention. The amino acids are identified 
by either the single-letter or three-letter designations: 



15 



20 



Asp 


D 


aspartic acid 


lie 


I 


isoleucine 


Thr 


T 


threonine 


Leu 


L 


leucine 


Ser 


S 


serine 


Tyr 


Y 


tyrosine 


GIu 


E 


glutamic acid 


Phe 


F 


phenylalanine 


Pro 


P 


proline 


His 


H 


histidine 


Gly 


G 


glycine 


Lys 


K 


lysine 


Ala 


A 


alanine 


Arg 


R 


arginine 


Cys 


C 


cysteine 


Trp 


W 


tryptophan 


Val 


V 


valine 


Gin 


Q 


glutamine 


Met 


M 


methionine 


Asn 


N 


asparagine 



These amino acids may be classified according to the chemical composition and properties of their side 
25 chains. They are broadly classified into two groups, charged and uncharged. Each of these groups is divided 
into subgroups to classify the amino acids more accurately: 
I. Charged Amino Acids 

Acidic Residues : aspartic acid, glutamic acid 
Basic Residues : lysine, arginine, histidine 
30 II. Uncharged Amino Acids 

Hvdrophilic Residues: serine, threonine, asparagine, glutamine 
Aliphatic Residues : glycine, alanine, valine, leucine, isoleucine 
Non-polar Residues: cysteine, methionine, proline 
Aromatic Residues : phenylalanine, tyrosine, tryptophan 
35 Tne terrn "amino acid sequence variant" refers to molecules with some differences in their amino acid 

sequences as compared to a native amino acid sequence. 
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Substitutional variants are those that have at least one amino acid residue in a native sequence removed 

and a different amino acid inserted in its place at the same position. The substitutions may be single, where only 

one amino acid in the molecule has been substituted, or they may be multiple, where two or more amino acids 

have been substituted in the same molecule. 

5 Insertional variants are those with one or more amino acids inserted immediately adjacent to an amino 

acid at a particular position in a native sequence. Immediately adjacent to an amino acid means connected to 

cither the oc-carboxy or a-amino functional group of the amino acid. 

Deletional variants are those with one or more amino acids in the native amino acid sequence removed. 

Ordinarily, deletional variants will have one or two amino acids deleted in a particular region of the molecule. 

10 "Antibodies (Abs) M and "immunoglobulins (Igs)" are glycoproteins having the same structural 

characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins include both 
antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides of the latter kind are, 
for example, produced at low levels by the lymph system and at increased levels by myelomas. 

Native antibodies and immunoglobulins are usually heterotetrameric glycoproteins of about 150.000 

! 5 daltons. composed of two identical light (L) chains and two identical heavy (H) chains. Each light chain is linked 
to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies between the heavy- 
chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain 
disulfide bridges. Each heavy chain has at one end a variable domain (V H ) followed by a number of constant 
domains. Each light chain has a variable domain at one and (V^) and a constant domain at its other end; the 

20 constant domain of the light chain is aligned with the first constant domain of the heavy chain, and the light chain 
variable domain is aligned with the variable domain of the heavy chain. Particular amino acid residues are 
believed to form an interface between the light and heavy chain variable domains (Clothia et ai, J. Mol. Biol. 
186 , 651-663 [1985]; Novotny and Haber, Proc. Natl. Acad. Sci. USA 82. 4592-4596 [1985]). 

The term "variable" refers to the fact that certain portions of the variable domains differ extensively in 

25 sequence among antibodies and are used in the binding and specificity of each particular antibody for its 
particular antigen. However, the variability is not evenly distributed through the variable domains of antibodies. 
It is concentrated in three segments called complementarity determining regions (CDRs) or hypervariable regions 
both in the light chain and the heavy chain variable domains. The more highly conserved portions of variable 
domains are called the framework (FR). The variable domains of native heavy and light chains each comprise 

30 four FR regions, largely adopting a P-sheet configuration, connected by three CDRs, which form loops 
connecting, and in some cases forming part of, the P-sheet structure. The CDRs in each chain are held together 
in close proximity by the FR regions and, with the CDRs from the other chain, contribute to the formation of the 
antigen binding site of antibodies (see Kabat, E.A. et ai. Sequences of Proteins of Immunological Interest , 
National Institute of Health, Bethesda, MD [1991]). The constant domains are not involved directly in binding 

35 an antibody to an antigen, but exhibit various effector functions, such as participation of the antibody in antibody- 
dependent cellular toxicity. 

Papain digestion of antibodies produces two identical antigen binding fragments, called Fab fragments, 
each with a single antigen binding site, and a residual "Fc" fragment, whose name reflects its ability to crystallize 
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readily. Pepsin treatment yields an F(ab')-> fragment that has two antigen combining sites and is still capable ol 

cross-linking antigen. 

M Fv" is the minimum antibody fragment which contains a complete antigen recognition and binding site. 
This region consists of a dimer of one heavy and one light chain variable domain in tight, non-covalent 
5 association, it is in this configuration that the three CDRs of each variable domain interact to define an antigen 
binding site on the surface of the V|_j-Vj^ dimcr. Collectively, the six CDRs confer antigen binding specificity 
to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific 
for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding 
site. 

10 The Fab fragment also contains the constant domain of the light chain and the first constant domain 

(CHI ) of the heavy chain. Fab' fragments differ from Fab fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab')-) antibody fragments originally were produced as pairs of Fab' fragments which have 

I 5 hinge cysteines between them. Other, chemical couplings of antibody fragments are also known. 

The light chains of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of two clearly distinct types, called kappa and lambda (k). based on the amino acid sequences of their; constant 
domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 

20 can be assigned to different classes. There are five major classes of immunoglobulins: IgA, lgD, IgE? IgG and 
IgM, and several of these may be further divided into subclasses (isotypes), e.g. IgG- 1 1 lgG-2, lgG-3, and IgG-4; 
IgA- 1 and lgA-2. The heavy chain constant domains that correspond to the different classes of immunoglobulins 
are called a, delta, epsilon, and u,, respectively. The subunit structures and three-dimensional configurations 
of different classes of immunoglobulins are well known. 

25 The term "antibody" is used in the broadest sense and specifically covers single monoclonal antibodies 

(including agonist and antagonist antibodies), antibody compositions with polyepitopic specificity, as well as 
antibody fragments (e.g., Fab, F(ab')2» and Fv), so long as they exhibit the desired biological activity. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 

30 except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies 
arc highly specific, being directed against a single antigenic site. Furthermore, in contrast to conventional 
(polyclonal) antibody preparations which typically include different antibodies directed against different 
determinants (epitopes), each monoclonal antibody is directed against a single determinant on the antigen. In 
addition to their specificity, the monoclonal antibodies are advantageous in that they are synthesized by the 

35 hybridoma culture, uncontaminated by other immunoglobulins. The modifier "monoclonal" indicates the 
character of the antibody as being obtained from a substantially homogeneous population of antibodies, and is 
not to be construed as requiring production of the antibody by any particular method. For example, the 
monoclonal antibodies to be used in accordance with the present invention may be made by the hybridoma 
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method first described by Kohler &. Milstein, Nature 256 :495 ( 1 975), or may be made by recombinant DN A 

methods [see, e.g. U.S. Patent No. 4,8 16.567 (Cabilly et al)\ 

The monoclonal antibodies herein specifically include "chimeric 0 antibodies (immunoglobulins) in 

which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in 

5 antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the 

remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies derived from 

another species or belonging to another antibody class or subclass, as well as fragments of such antibodies, so 

long as they exhibit the desired biological activity (U.S. Patent No. 4,8 1 6,567 (Cabilly et aL ; Morrison et al , 

Proc. Natl. Acad. Sci. USA 81. 6851-6855 [1984]). 

10 "Humanized" forms of non-human (e.g. murine) antibodies are chimeric immunoglobulins, 

immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab*)-> or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. For 
the most part, humanized antibodies are human immunoglobulins (recipient antibody) in which residues from 
a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 

1 5 human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. 
In some instances. Fv framework residues of the human immunoglobulin are replaced by corresponding non- 
human residues. Furthermore, humanized antibody may comprise residues which are found neither in the 
recipient antibody nor in the imported CDR or framework sequences. These modifications are made to further 
refine and optimize antibody performance. In general, the humanized antibody will comprise substantially all 

20 of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of 
a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least 
a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. For further 
details see: Jones et aL. Nature 321. 522-525 [1986]; Reichmann et al. Nature 332 . 323-329 [1988]; EP-B-239 

25 400 published 30 September 1987; Presta, Curr. Op. Struct. Biol. 2 593-596 [1992J: and FP-B-451 216 
published 24 January 1996). 

In the context of the present invention the expressions "cell", "cell line", and "cell culture" are used 
interchangeably, and all such designations include progeny. It is also understood that all progeny may not be 
precisely identical in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the 

30 same function or biological property, as screened for in the originally transformed cell, are included. 

The terms "replicable expression vector" and "expression vector" refer to a piece of DNA. usually 
double-stranded, which may have inserted into it a piece of foreign DNA. Foreign DNA is defined as 
heterologous DNA, which is DNA not naturally found in the host cell. The vector is used to transport the foreign 
or heterologous DNA into a suitable host cell. Once in the host cell, the vector can replicate independently of 

35 the host chromosomal DNA, and several copies of the vector and its inserted (foreign) DNA may be generated. 
In addition, the vector contains the necessary elements that permit translating the foreign DNA into a 
polypeptide. Many molecules of the polypeptide encoded by the foreign DNA can thus be rapidly synthesized. 

"Oligonucleotides" are short- length, single- or double-stranded polydeoxynucleotides that are 
chemically synthesized by known methods [such as phosphotricster, phosphite, or phosphoramidite chemistry. 
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using solid phase techniques such as those described in EP 266.032, published 4 May 1988, or via 

deoxynucleoside H-phosphanate intermediates as described by Froehler et a/., Nucl. Acids Res. 14 , 5399 ( 1986). 

They are then purified on polyacrylamide gels. 

B. Production of PTP HSCs by recombinant DNA technology 

5 1. Identification and isolation of nucleic acid encoding PTP HSCs 

The native PTP HSC proteins of the present invention may be isolated from relatively undifferentiated, 

early hematopoietic stem or progenitor cells. The isolation of murine PTP HSC from the CD34 ni fraction of 

murine 10.5 day yolk sac or embryo cells is illustrated in the examples. Similarly, murine PTP HSC can be 

isolated from CD34 population originated from bone marrow or fetal liver. The purity of these murine cells 

10 was found to be a critical step in isolating the mRNA encoding the new murine PTP HSC of the present 

invention. A high degree of purity was achieved by purification with a rabbit anti-murinc CD34 antibody 

followed by a lineage depletion step and a positive selection step with the Sea antibody. Alternatively, murine 

PTP HSC can be detected and obtained from other relatively undifferentiated precursors of mature murine 

hematopoietic cells, such as, BAF 3, 32D and FDCP hematopoietic progenitor cells, available from the American 

1 5 Type Culture Collection (ATCC). Native human PTP HSC can, for example, be identified in and obtained from 
human CMK progenitor cells. As the PTP HSCs enzymes have an extremely low abundance in embryonic 
tissues, their purification by traditional methods would be very cumbersome and inefficient. Instead, cDN A or 
genomic clones encoding the PTP HSC proteins of the present invention can be prepared using standard 
techniques of recombinant DNA technology. For example, cDNA library can be constructed by obtaining 

20 polyadenylated mRNA from a cell line known to express the desired PTP HSC, and using the mRNA as a 
template to synthesize double stranded cDNA. Exemplary human and non-human cell lines suitable for this 
purpose have been listed hereinabove. A PTP HSC polypeptide gene can also be obtained from agenomic 
library, such as a human genomic cosmid library. 

Libraries, either cDNA or genomic, are then screened with probes designed to identify the gene of 

25 interest or the protein encoded by it. For cDNA expression libraries, suitable probes include monoclonal and 
polyclonal antibodies that recognize and specifically bind to a PTP HSC polypeptide. For cDNA libraries, 
suitable probes include carefully selected oligonucleotide probes (usually of about 20-80 bases in length) that 
encode known or suspected portions of a PTP HSC polypeptide from the same or different species, and/or 
complementary or homologous cDNAs or fragments thereof that encode the same or a similar gene. Appropriate 

30 probes for screening genomic DNA libraries include, without limitation, oligonucleotides, cDNAs, or fragments 
thereof that encode the same or a similar gene, and/or homologous genomic DNAs or fragments thereof. 
Screening the cDNA or genomic library with the selected probe may be conducted using standard procedures 
as described in Chapters 10-12 of Sambrook et ai y Molecular Cloning: A Laboratory Manual . New York, Cold 
Spring Harbor Laboratory Press. 1989. 

35 If DNA encoding an enzyme of the present invention is isolated by using carefully selected 

oligonucleotide sequences to screen cDNA libraries from various tissues, the oligonucleotide sequences selected 
as probes should be sufficient in length and sufficiently unambiguous that false positives are minimized. The 
actual nucleotide sequence(s) is/arc usually designed based on regions which have the least codon redundance. 



-II- 

BNSOOCID: <WO 973501 9A1 > 



WO 97/35019 PCT/US97/05278 

The oligonucleotides may be degenerate at one or more positions. The use of degenerate oligonucleotides is of 

particular Importance where a library is screened from a species in which preferential codon usage is not known. 

The oligonucleotide must be labeled such that it can be detected upon hybridization to DNA in the 

library being screened. The preferred method of labeling is to use ATP (e.g., y p ) and polynucleotide kinase 

5 to radiolabel the 5' end of the oligonucleotide. However, other methods may be used to label the oligonucleotide, 

including, but not limited to, biotinylation or enzyme labeling. 

cDNAs encoding PTP HSCs can also be identified and isolated by other known techniques of 

recombinant DNA technology, such as by direct expression cloning, or by using the polymerase chain reaction 

(PCR) as described in U.S. Patent No. 4,683,195, issued 28 July 1987, in section 14 of Sambrook et at., supra, 

10 or in Chapter 1 5 of Current Protocols in Molecular Biology . Ausubel et al eds., Greene Publishing Associates 

and Wiley-lnterscience 1991 . The use of the PCR technique for obtaining cDNA encoding murine PTP I ISC 

or the PTP domain of this native protein is also illustrated in the examples. 

Once cDNA encoding a PTP HSC enzyme from one species has been isolated, cDNAs from other 

species can also be obtained by cross-species hybridization. According to this approach, human or other 

1 5 mammalian cDNA or genomic libraries are probed by labeled oligonucleotide sequences selected from known 

PTP HSC sequences (such as murine PTP HSC) in accord with known criteria, among which is that the sequence 

should be sufficient in length and sufficiently unambiguous that false positives are minimized. Typically, a ^P- 

labeled oligonucleotide having about 30 to 50 bases is sufficient, particularly if the oligonucleotide contains one 

or more codons for methionine or tryptophan. Isolated nucleic acid will be DNA that is identified and separated 

20 from contaminant nucleic acid encoding other polypeptides from the source of nucleic acid. Hybridization is 

preferably performed under "stringent conditions" which means (I) employing low ionic strength and hgh 

temperature for washing, for example, 0.015 sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl 

sulfate at 50 °C, or (2) employing during hybridization a denaturing agent, such as formamide, for example, 50% 

(vol/vol) formamide with 0.1% bovine serum albumin/0.1% polyvinylpyrrolidone/50 nM sodium phosphate 

25 buffer at pH 6.5 with 650 mM sodium chloride, 75 mM sodium citrate at 42 °C. Another example is the use of 

5)% formamide, 5 x SSC (0.75 M sodium chloride, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 

0.1% sodium pyrophosphate, 5 x Derthardt's solution, sonicated salmon sperm DNA (50 ug/ml), 0.1% SDS, and 

10%dextran sulfate at 42 °C, with washes at 42 °C in 0.2 x SSC and 0.1% SDS. 

Once the sequence is known, the gene encoding a particular PTP HSC polypeptide can also be obtained 

30 by chemical synthesis, following one of the methods described in Engels and Uhlmann, Agnew. Chem. Int. Ed. 

Engl. 28 . 716 (1989). These methods include triester, phosphite, phosphoramidite and H-phosphonate methods, 

PCR and other autoprimer methods, and oligonucleotide syntheses on solid supports. 

2. Cloning and expression of nucleic acid encoding PTP HSCs 

Once the nucleic acid encoding PTP HSC is available, it is generally ligated into a replicable expression 

35 vector for further cloning (amplification of the DNA), or for expression. 

Expression and cloning vectors are well known in the art and contain a nucleic acid sequence that 

enables the vector to replicate in one or more selected host cells. The selection of the appropriate vector will 

depend on 1 ) whether it is to be used for DNA amplification or for DNA expression, 2) the size of the DNA to 

be inserted into the vector, and 3) the host cell to be transformed with the vector. Each vector contains various 
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components depending on its function (amplification of DNA of expression of DNA) and the host cell for which 

it is compatible. The vector components generally include, but are not limited to. one or more of the following. 

a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a 

transcription termination sequence. Construction of suitable vectors containing one or more of the above listed 

5 components, the desired coding and control sequences, employs standard ligation techniques. Isolated plasm ids 

or DNA fragments are cleaved, tailored, and religated in the form desired to generate the plasmids required. For 

analysis to confirm correct sequences in plasmids constructed, the ligation mixtures are commonly used to 

transform E. coli cells, e.g. E. coli K12 strain 294 (ATCC 3 1,446) and successful transformants selected by 

ampicillin or tetracycline resistance where appropriate. Plasmids from the transformants are prepared, analyzed 

1 0 by restriction endonuc lease digestion, and/or sequenced by the method of Messing et ai , Nucleic Acids Res. 9, 
309 ( 1 98 1 ) or by the method of Maxam et ai , Methods in Enzvmologv 65, 499 ( 1 980). 

The polypeptides of the present invention may be expressed in a variety of prokaryotic and eukaryotic 
host cells. Suitable prokaryotes include gram negative or gram positive organisms, for example E. coli or bacilli. 
A preferred cloning host is E. coli 294 (ATCC 31,446) although other gram negative or gram positive 

15 prokaryotes such as E. coJi B, £. £oJj X1776 (ATCC 31,537), E. coh W31 10 (ATCC 27,325), Pseudomonas 
species, or Serratia Marcesans are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable hosts for 
vectors herein. Saccharomvces cerevisiae . or common baker's yeast, is the most commonly used among lower 
eukaryotic host microorganisms. However, a number of other genera, species and strains are commonly.available 

20 and useful herein, such as S. pom be [Beach and Nurse, Nature 290 . 140 (1981)], Kluvveromvces lactis 
[Louvencourt et ai, J. Bacteriol. 737 ( 1 983)]; varrowia (EP 402,226); Pichia pastoris (EP 1 83,070), Trichoderma 
reesia (EP 244.234). Neurospora crassa (Case et ai. Proc. Natl. Acad. Sci. USA 76. 5259-5263 (1979)]; and 
Aspergillus hosts such as A. nidulans [Ballance et ai, Biochem. Biophvs. Res. Commun. 1 12 . 284-289 ( 1 983); 
Tilburn et ai. Gene 26. 205-22 1 (1983): Yelton etai. Proc. Natl. Acad. Sci. USA 81. 1470-1474 (1984)] and 

25 A. nig er [Kelly and Hynes, EMBO J. 4, 475-479 ( 1 985)]. 

Suitable host cells may also derive from multicellular organisms. Such host cells are capable of 
complex processing and glycosylation activities. In principle, any higher eukaryotic cell culture is workable, 
whether from vertebrate or invertebrate culture, although cells from mammals such as humans are preferred. 
Examples of invertebrate cells include plants and insect cells. Numerous baculoviral strains and variants and 

30 corresponding permissive insect host cells from hosts such as Spodoptera frueiperda (caterpillar), Aedes aegvpti 
(mosquito), Ae^es alfropictus (mosquito), Drosophila melangaster (fruitfly), and Bombvx mori host cells have 
been identified. See, e.g. Luckow et ai. Bio/Technology 6. 47-55 ( 1 9ftxV Miller etai, in Genetic Engineering . 
Setlow, J.K. et ai, eds., Vol. 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda et ai, Nature 315 , 592-594 
( 1 985). A variety of such viral strains are publicly available, e.g. the L- 1 variant of Autoerapha californica NPV. 

35 and such viruses may be used as the virus herein according to the present invention, particularly for transfection 
of Spodoptera frueiperda cells. 

Plant cell cultures of cotton, com, potato, soybean, petunia, tomato, and tobacco can be utilized as hosts. 
Typically, plant cells are transfected by incubation with certain strains of the bacterium Agrobacterium 
tumefaciens. which has been previously manipulated to contain the PTP HSC DNA. During incubation of the 
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plant cell culture with A. tumefaciens . the DNA encoding a PTP HSC is transferred to the plant cell host such 

that it is transfected. and will, under appropriate conditions, express the PTP HSC DNA. In addition, regulator)' 

and signal sequences compatible with plant cells are available, such as the nopaiine synthase promoter and 

polyadenylation signal sequences. Depickerg/ a/.. J. Mol. Appl. Gen. 1. 561 (1982). In addition. DNA segments 

5 isolated from the upstream region of the T-DNA 780 gene are capable of activating or increasing transcription 

levels of plant-expressible genes in recombinant DNA-containing plant tissue. See EP 32 1, 1 96 published 2 1 

June 1989. 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in culture 
(tissue culture) is per se well known. Sec Tissue Culture . Academic Press, Kruse and Patterson, editors (1973). 

1 0 Examples of useful mammalian host cell lines are monkey kidney CV 1 line transformed by S V40 (COS-7. ATCC 
CRL 1651); human embryonic kidney cell line [293 or 293 cells subcloned for growth in suspension culture, 
Graham et g/. J.Gen. Virol. 36. 59 (1977)]; baby hamster kidney cells 9BHK. ATCC CCL 10); Chinese hamster 
ovary cellsADHFR [CHO, Urlaub and Chasin, Proc. Natl. Acad. ScL USA 77, 42 1 6 ( 1 980)]; mouse sertolli cells 
[TM4, Mather. Biol. Reorod. 23. 243-25 1 (1980)]; monkey kidney cells (CVI ATCC CCL 70); African green 

1 5 monkey kidney cells (VERO-76, ATCC CRL- 1 587); human cervical carcinoma cells (HELA, ATCC CCL 2 ): 
canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human lung 
cells (W J38, ATCC CCL75); human liver cells (Hep G2, HB 8065); mouse mammary tumor (MMT 060562. 
ATCC CCL5 1); TRl cells [Mather et a/. t Annals N.Y. Acad. Sci. 383 . 44068 (1982)]; MRC 5 cells; FS4 cells; 
and a human hepatoma cell line (Hep G2). Preferred host cells are human embryonic kidney 293 and Chinese 

20 hamster ovary cells. 

Particularly useful in the practice of this invention are expression vectors that provide for the transient 
expression in mammalian cells of DNA encoding a PTP HSC. In general, transient expression involves the use 
of an expression vector that is able to replicate efficiently in a host cell, such that the host cell accumulates many 
copies of the expression vector and, in turn, synthesizes high levels of a desired polypeptide encoded by the 

25 expression vector. Transient systems, comprising a suitable expression vector and a host cell, allow for the 
convenient positive identification of polypeptides encoded by clones DNAs, as well as for the rapid screening 
of such polypeptides for desired biological or physiological properties. Thus, transient expression systems are 
particularly useful in the invention for purposes of identifying analogs and variants of a PTP HSC. 
Other methods, vectors, and host cells suitable for adaptation to the synthesis of the PTP HSC polypeptides in 

30 recombinant vertebrate cell culture are described in Getting et al. % Nature 293 . 620-625 ( 1 98 1 ); Mantel et aL, 
Nature 281 . 40-46 (1979); Levinson et ai \ EP 1 17,060 and EP I 17,058. Particularly useful plasmids for 
mammalian cell culture expression of the PTP HSC polypeptides are pRK5 (EP 307.247). or pSVI6B (PCT 
Publication No. WO 91/08291). 

Other cloning and expression vectors suitable for the expression of the PTP HSCs of the present 

35 invention in a variety of host cells are, for example, described in EP 457,758 published 27 November 1991 . A 
large variety of expression vectors is now commercially available. An exemplary commercial yeast expression 
vector is pPIC.9 (Invitrogen), while an commercially available expression vector suitable for transformation of 
E. coli cells is PET15b (Novagen). 
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C. Culturtng the Host Cells 

Prokaryotes cells used to produced the PTP HSCs of this invention are cultured in suitable media as 
describe generally in Sambrook et ai y supra . 

Mammalian cells can be cultured in a variety of media. Commercially available media such as Ham's 
5 F10 (Sigma), Minimal Essential Medium (MEM, Sigma), RPM I- 1640 (Sigma), and Dulbecco's Modified Eagle s 
Medium (DMEM, Sigma) are suitable for culturing the host cells. In addition, any of the media described in 
Ham and Wallace, Meth. EnzvmoL 58. 44 (1979); Barnes and Sato, Anal. Biochem. 102 . 255 (1980), US 
4,767,704; 4,657,866; 4,927,762; or 4,560,655; WO 90/03430; WO 87/00195 or US Pat. Rc. 30.985 may be 
used as culture media for the host cells. Any of these media may be supplemented as necessary with hormones 
10 and/or other growth factors (such as insulin, transferrin, or epidermal growth factor), salts (such as sodium 
chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides (such as adenosine and 
thymidine), antibiotics (such as Gentamycin™ drug) trace elements (defined as inorganic compounds usually 
present at final concentrations in the micromolar range), and glucose or an equivalent energy source. Any other 
necessary supplements may also be included at appropriate concentrations that would be known to those skilled 
1 5 in the art. The culture conditions, such as temperature, pH and the like, suitably are those previously used with 
the host cell selected for cloning or expression, as the case may be. and will be apparent to the ordinary artisan. 

The host cells referred to in this disclosure encompass cells in in vitro cell culture as well as cells that 
are within a host animal or plant. z 

It is further envisioned that the PTP HSCs of this invention may be produced by homologous 
20 recombination, or with recombinant production methods utilizing control elements introduced into cells already 
containing DNA encoding the particular PTP HSC. 

D. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. 

25 A93d. S<?i, USA 22, 5201-5205 (1980)], dot blotting (DNA analysis), or m situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herein. Various labels may be employed, most 
commonly radioisotopes, particularly Jji P. However, other techniques may also be employed, such as using 
biotin-modifled nucleotides for introduction into a polynucleotide. The biotin then serves as a site for binding 
to avidin or antibodies, which may be labeled with a wide variety of labels, such as radionuclides, fluoresces, 

30 enzymes, or the like. Alternatively, antibodies may be employed that can recognize specific duplexes, including 
DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. The antibodies in 
turn may be labeled and the assay may be carried out where the duplex is bound to the surface, so that upon the 
formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 

35 immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate directly 
the expression of gene product. With immunohistochemical staining techniques, a cell sample is prepared, 
typically by dehydration and fixation, followed by reaction with labeled antibodies specific for the gene produci 
coupled, where the labels are usually visually detectable, such as enzymatic labels, fluorescent labels. 
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luminescent labels, and the like. A particularly sensitive staining technique suitable for use in the present 

invention is described by Hse ei ai , Am. J. Clin. Pharm. 7£, 734-738 ( 1 980). 

Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either 
monoclonal or polyclonal, and may be prepared in any animal. Conveniently, the antibodies may be prepared 
5 against a native PTP HSC polypeptide, or against a synthetic peptide based on the DNA sequence provided 
herein as described further hereinbelow. 

E. Amino Acid Sequence Variants of a native PTP HSCs 

Amino acid sequence variants of native PTP HSCs are prepared by methods known in the art by 
introducing appropriate nucleotide changes into a PTP HSC DNA, or by in vitro synthesis of the desired 

10 polypeptide. There are two principal variables in the construction of amino acid sequence variants: the location 
of the mutation site and the nature of the mutation. With the exception of naturally-occurring alleles, which do 
not require the manipulation of the DNA sequence encoding the PTP HSC, the amino acid sequence variants of 
PTP HSCs are preferably constructed by mutating the DNA, either to arrive at an allele or an amino acid 
sequence variant that does not occur in nature. 

1 5 One group of the mutations will be created within the phosphatase (PTP) domain of the enzymes of 

the present invention. Non-conservative substitutions within this domain may result in PTP HSC variants which 
loose their ability to dephosphatase tyrosines and will, therefore, be useful as antagonists of native PTP HSCs. 
PTP HSC variants mutated to enhance their enzymatic activity will be useful, for example, as more effective 
inhibitors of progenitor/stem cell differentiation. 

20 Alternatively or in addition, amino acid alterations can be made at sites that differ in PTP HSC proteins 

from various species, or in highly conserved regions, depending on the goal to be achieved. Sites at such 
locations will typically be modified in series, e.g. by ( 1 ) substituting first with conservative choices and then with 
more radical selections depending upon the results achieved, (2) deleting the target residue or residues, or (3) 
inserting residues of the same or different class adjacent to the located site, or combinations of options 1-3. One 

25 helpful technique is called "alanine scanning" (Cunningham and Wells. Science 244 . 1081-1085 [1989]). 

After identifying the desired mutation(s), the gene encoding a PTP HSC variant can, for example, be 
obtained by chemical synthesis as hereinabove described. More preferably, DNA encoding a PTP HSC amino 
acid sequence variant is prepared by site-directed mutagenesis of DNA that encodes an earlier prepared variant 
or a nonvariant version of the PTP HSC. Site-directed (site-specific) mutagenesis allows the production of PTP 

30 HSC variants through the use of specific oligonucleotide sequences that encode the DNA sequence of the desired 
mutation, as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of sufficient size 
and sequence complexity to form a stable duplex on both sides of the deletion junction being traversed. 
Typically, a primer of about 20 to 25 nucleotides in length is preferred, with about 5 to 10 residues on both sides 
of the junction of the sequence being altered. In general, the techniques of site-specific mutagenesis arc well 

35 known in the art, as exemplified by publications such as. Edelman ei a/„ DNA 2> 183 (1983). As will be 
appreciated, the site-specific mutagenesis technique typically employs a phage vector that exists in both a single- 
stranded and double-stranded form. Typical vectors useful in site-directed mutagenesis include vectors such as 
the M 1 3 phage, for example, as disclosed by Messing ei ai , Third Cleveland Symposium on Macromolecules 
and Recombinant DNA , A. Walton, ed., Klsevier, Amsterdam (1981). This and other phage vectors are 
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commercially available and their use is well known to those skilled in the art. A versatile and efficient procedure 

for the construction of oligodeqxyribonucleotide directed site-specific mutations in DNA fragments using M 1 3- 

derived vectors was published by Zolier, M.J. and Smith. M., Nucleic Acids Res. K). 6487-6500 f 1982J). Also. 

plasmid vectors that contain a single-stranded phage origin of replication (Veira et ai, Mcth. Enzvmol. 153. 3 

5 [1987]) may be employed to obtain single-stranded DNA. Alternatively, nucleotide substitutions areintroduced 

by synthesizing the appropriate DNA fragment in vitro, and amplifying it by PCR procedures known in the an. 

The PCR technique may also be used in creating amino acid sequence variants of a PTP HSC. In a 

specific example of PCR mutagenesis, template plasmid DNA (I ug) is linearized by digestion with a restriction 

endonuclease that has a unique recognition site in the plasmid DNA outside of the region to be amplified. Of 

1 0 this material, 1 00 ng is added to a PCR mixture containing PCR buffer, which contains the four deoxynucleotidc 
triphosphates and is included in the GeneAmp R kits (obtained from Perkin-Elmcr Cetus, Norwalk, CT and 
Emeryville, CA), and 25 pmole of each oligonucleotide primer, to a final volume of 50 ul. The reaction mixture 
is overlayered with 35 ul mineral oil. The reaction is denatured for 5 minutes at 100oC, placed briefly on ice, 
and then 1 ul Thermus aquaticus (lag) DNA polymerase (5 units/ I), purchased from Perkin-Elmer Cetus, 

1 5 Norwalk, CT and Emeryville, CA) is added below the mineral oil layer. The reaction mixture is then inserted 
into a DNA Thermal Cycler (purchased from Perkin-Elmer Cetus) programmed as follows: 
2 min. 55oC, 

30 sec. 72oC, then 19 cycles of the following: ~ 
30 sec. 94oC, . : 
20 30 sec. 55°C, and 

30sec.,72oC. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous phase 
transferred to a new vial, extracted with phenol/chloroform (50:50 vol), and ethanol precipitated, and the DNA 
is recovered by standard procedures. This material is subsequently subjected to appropriate treatments for 

25 insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique described by 
Wells etaL f Gene 34. 315 (1985^. 

Additionally, the so-called phagemid display method may be useful in making amino acid sequence 
variants of native or variant PTP HSCs or their fragments. This method involves (a) constructing a replicable 

30 expression vector comprising a first gene encoding an receptor to be mutated, a second gene encoding at least 
a portion of a natural or wild-type phage coat protein wherein the first and second genes are heterologous, and 
a transcription regulatory element operably linked to the first and second genes, thereby forming a gene fusion 
encoding a fusion protein; (b) mutating the vector at one or more selected positions within the first gene thereby 
forming a family of related plasmids; (c) transforming suitable host cells with the plasmids; (d) infecting the 

35 transformed host cells with a helper phage having a gene encoding the phage coat protein; (e) culturing the 
transformed infected host cells under conditions suitable for forming recombinant phagemid panicles containing 
at least a portion of the plasmid and capable of transforming the host, the conditions adjusted so that no more 
than a minor amount of phagemid particles display more than one copy of the fusion protein on the surface of 
the particle; (0 contacting the phagemid particles with a suitable antigen so that at least a portion of the phagemid 
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particles bind to the antigen; and (g) separating the phagemid particles that bind from those that do not. Steps 

(d) through (g) can be repeated one or more times. Preferably in this method the plasmid is under tight control 

of the transcription regulatory element, and the culturing conditions are adjusted so that the amount or number 

of phagemid particles displaying more than one copy of the fusion protein on the surface of the particle is less 

5 than about 1%. Also, preferably, the amount of phagemid particles displaying more than one copy of the fusion 

protein is less than 10% of the amount of phagemid panicles displaying a single copy of the fusion protein. Most 

preferably, the amount is less than 20%. Typically in this method, the expression vector will further contain a 

secretory signal sequence fused to the DNA encoding each subunit of the polypeptide and the transcription 

regulatory element will be a promoter system. Preferred promoter systems are selected from Jac Z. Xp^ , tac. T7 

10 polymerase, tryptophan, and alkaline phosphatase promoters and combinations thereof. Also, normally the 
method will employ a helper phage selected from M 1 3K07, M 1 3R408, M 1 3-VCS, and Phi X 1 74. The preferred 
helper phage is M13K07, and the preferred coat protein is the M 13 Phage gene 111 coat protein. The preferred 
host is E. coli, and protease-deficient strains of E. coli. 

Further details of the foregoing and similar mutagenesis techniques are found in general textbooks, such 

] 5 as, for example, Sambrook et aL, supra, and Current Protocols in Molecular Biology . Ausubel et al cds., supra. 

Naturally-occurring amino acids are divided into groups based on common side chain properties: 

(1) hydrophobic: norleucine, met, aia, val, leu, ile; 

(2) neutral hydrophobic: cys, ser. thr; 

(3) acidic: asp, glu; 

20 (4) basic: asn, gin, his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr, phe. 

Conservative substitutions involve exchanging a member within one group for another member within 
the same group, whereas non-conservative substitutions will entail exchanging a member of one of these classes 
25 for another. 

Amino acid sequence deletions generally range from about 1 to 30 residues, more preferably about 1 
to 10 residues, and typically are contiguous. 

Amino acid insertions include amino- and/or carboxyl-terrninal fusions ranging in length from one 
residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions of single or 
30 multiple amino acid residues. Intrasequence insertions (i.e. insertions within the PTP HSC protein amino acid 
sequence) may range generally from about 1 to 10 residues, more preferably 1 to 5 residues, more preferably 1 
to 3 residues. Examples of terminal insertions include the PTP HSC polypeptides with an N-tcrminal methionyl 
residue, an artifact of its direct expression in bacterial recombinant cell culture, and fusion of a heterologous N- 
terminal signal sequence to the N -terminus of the PTP HSC molecule to facilitate the secretion of the mature PTP 
35 HSC from recombinant host cells. Such signal sequences will generally be obtained from, and thus homologous 
to. the intended host cell species. Suitable sequences include STI1 or Ipp for E. coli . alpha factor for yeast, and 
viral signals such as herpes gD for mammalian cells. 

Other insertional variants of the native PTP HSC molecules include the fusion of the N- or C-icrminus 
of the TRAF molecule to immunogenic polypeptides, e.g. bacterial polypeptides such as beta-laciamase or an 
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enzyme encoded by the E. coli trp locus, or yeast protein, and C-terminal fusions with proteins having a long 

half-life such as immunoglobulin regions (preferably immunoglobulin constant regions), albumin, or ferritin, as 

described in WO 89/02922 published on 6 April 1989. 

Since it is often difficult to predict in advance the characteristics of a variant PTP HSC. it will be 

5 appreciated that some screening will be needed to select the optimum variant. 

F. Covaient Modifications of PTP HSC Polypeptides 

Covalent modifications of PTP HSCs arc included within the scope herein. Such modifications are 
traditionally introduced by reacting targeted amino acid residues of the PTP HSC polypeptides with an organic 
derivatizing agent that is capable of reacting with selected sides or terminal residues, or by harnessing 

10 mechanisms of post-translational modifications that function in selected recombinant host cells. The resultant 
covalent derivatives are useful in programs directed at identifying residues important for biological activity, for 
immunoassays of the PTP HSC, or for the preparation of anti-PTP HSC antibodies for immunoaffmity 
purification of the recombinant. For example, complete inactivation of the biological activity of the protein after 
reaction with ninhydrin would suggest that at least one arginyl or lysyl residue is critical for its activity, 

1 5 whereafter the individual residues which were modified under the conditions selected are identified by isolation 
of a peptide fragment containing the modified amino acid residue. Such modifications are within the ordinary 
skill in the art and are performed without undue experimentation. 

Cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding amines), such 
as chloroacetic acid or chloroacetamide, to give carboxyrnethyl or carboxyamidomethyl derivatives. Cysteinyl 

20 residues also are derivatized by reaction with bromotrifluoroacetone, ct-bromo-P-(5-imidozoyl)propionic acid, 
chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide, p- 
chloromercuribenzoate, 2-chloromercuri-4-nitrophenoi, or chloro-7-nitrobenzo-2-oxa- 1 ,3-diazole. 

Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0 because this agent 
is relatively specific for the histidyl side chain. Para-bromophenacyl bromide also is useful; the reaction is 

25 preferably performed in 0. 1 M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. 
Derivatization with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable 
reagents for derivatizing a-amino-containing residues include imidoesters such as methyl picolinimidate: 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid; O-methylisourea; 2,4- 

30 pentanedione; and transaminase-catalyzed reaction with glyoxylate. 

Arginyl residues are modified by reaction with one or several conventional reagents, among them 
phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginine residues 
requires that the reaction be performed in alkaline conditions because of the high pK a of the guanidine functional 
group. Furthermore, these reagents may react with the groups of lysine as well as the arginine epsilon-amino 

35 group. 

The specific modification of tyrosyl residues may be made, with particular interest in introducing 
spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitromcthane. Most 
commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl tyrosyl species and 3-nitro 
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derivatives* respectively. Tyrosyl residues are iodinated using *~^l or ^ W to prepare labeled proteins for use 

in radioimmunoassay. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides (R l - 
N=C=N-R') such as I -cyciohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or 1 -ethyl-3-(4-azonia-4.4- 
5 dimcthylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are converted to asparaginyl and 
glutaminyl residues by reaction with ammonium ions. 

Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and 
aspartyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Either form of 
these residues falls within the scope of this invention. 
10 Other modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups 

of seryl, threonyl or tyrosyl residues, methylation of the cc-amino groups of lysine, arginine, and histidinc side 
chains (T.E. Creighton, Proteins: Structure and Molecular Properties . W.H. Freeman & Co., San Francisco, pp. 
79-86 [1983]), acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. The 
molecules may further be covalently linked to nonproteinaceous polymers, e.g. polyethylene glycol. 
15 polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. S.N. 07/275,296 or U.S. patents 
4,640,835; 4,496,689; 4,301 , 144; 4,670,41 7; 4,79 1, 192 or 4, 1 79,337. 

Derivatization with Afunctional agents is useful for preparing intramolecular aggregates of the PTP 
HSCs with polypeptides as well as for cross-linking the PTP HSC polypeptide to a water insoluble support matrix 
or surface for use in assays or affinity purification. In addition, a study of interchain cross-links will provide 
20 direct information on conformational structure. Commonly used cross-linking agents include 1.1- 
bis(diazoacetyl)-2-phenyiethane, glutaraldehyde, N-hydroxysuccinimide esters, homobifunctional imidoesters, 
and Afunctional maleimides. Derivatizing agents such as methyl-3-[(p-azidophenyI)dithio]propioimidate yield 
photoactivatable intermediates which are capable of forming cross-links in the presence of light. Alternatively, 
reactive water insoluble matrices such as cyanogen bromide activated carbohydrates and the systems reactive 
25 substrates described in U.S. Patent Nos. 3,959,642; 3,969,287; 3,691,016; 4,195,128; 4,247,642; 4,229,537; 
4,055,635; and 4,330,440 are employed for protein immobilization and cross-linking. 

Certain post-trans lationa I modifications are the result of the action of recombinant host cells on the 
expressed polypeptide. Glutaminyl and aspariginyl residues are frequently post-translationally deamidated to 
the corresponding glutamyl and aspartyl residues. Alternatively, these residues are deamidated under mildly 
30 acidic conditions. Either form of these residues falls within the scope of this invention. 

Other post-translational modifications include hydroxylation of proline and lysine, phosphorylation of 
hydroxyl groups of seryl, threonyl or tyrosyl residues, methylation of the a-amino groups of lysine, arginine, and 
histidine side chains [T.E. Creighton, Proteins: Structure and Molecular Properties . W.H. Freeman & Co.. San 
Francisco, pp. 79-86 (1983)]. 
35 Other derivatives comprise the novel peptides of this invention covalently bonded to a nonproteinaceous 

polymer. The nonproteinaceous polymer ordinarily is a hydrophilic synthetic polymer, i.e. a polymer not 
otherwise found in nature. However, polymers which exist in nature and arc produced by recombinant or in vitro 
methods are useful, as are polymers which are isolated from nature. Hydrophilic polyvinyl polymers fall within 
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the scope of this invention, e.g. polyvinylalcohol and polyvinylpyrrolidone. Particularly useful arc 

polyvinylalkyiene ethers such a polyethylene glycol, polypropylene glycol. 

The PTP HSC polypeptides may be linked to various nonproteinaceous polymers, such as polyethylene 

glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 4.640,835; 

5 4.496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

The PTP HSCs may be entrapped in microcapsules prepared, for example, by coacervation techniques 

or by interfacial polymerization, in colloidal drug delivery systems (e.g. liposomes, albumin microspheres. 

microemulsions, nano-particles and nanocapsules), or in macroemulsions. Such techniques arc disclosed in 

Remington's Pharmaceutical Sciences . 1 6th Edition, Osol, A., Ed. (1980). 

10 G. Anti-PTP HSC antibody preparation 

(i) Polyclonal antibodies 

Polyclonal antibodies to a PTP HSC molecule generally are raised in animals by multiple subcutaneous 
(sc) or intraperitoneal (ip) injections of the PTP HSC and an adjuvant. It may be useful to conjugate the PTP 
HSC or a fragment containing the target amino acid sequence to a protein that is immunogenic in the species to 

15 be immunized, e.g. keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin 
inhibitor using a Afunctional or derivatizing agent, for example maleimidobenzoyl sulfosuccinimide ester 
(conjugation through cysteine residues), N-hydroxysuccinimidc (through lysine residues), glytaraldehyde. 
succinic anhydride, SOC!> or R*N=0=NR, where R and R* are different alkyl groups. 

Animals are immunized against the immunogenic conjugates or derivatives by combining 1 mg or 1 ug 

20 of conjugate (for rabbits or mice, respectively) with 3 volumes of Freud's complete adjuvant and injecting the 
solution intradermally at multiple sites. One month later the animals are boosted with 1/5 to 1/10 the original 
amount of conjugate in Freud's complete adjuvant by subcutaneous injection at multiple sites. 7 to 14 days later 
the animals are bled and the serum is assayed for anti-PTP HSC antibody titer. Animals are boosted until the 
titer plateaus. Preferably, the animal boosted with the conjugate of the same PTP HSC. but conjugated to a 

25 different protein and/or through a different cross-linking reagent. Conjugates also can be made in recombinant 
cell culture as protein fusions. Also, aggregating agents such as alum are used to enhance the immune response. 

(ii) Monoclonal antibodies 

Monoclonal antibodies are obtained from a population of substantially homogeneous antibodies, i.e., 

the individual antibodies comprising the population are identical except for possible naturally-occurring 
30 mutations that may be present in minor amounts. Thus, the modifier "monoclonal" indicates the character of the 

antibody as not being a mixture of discrete antibodies. 

For example, the anti-PTP HSC monoclonal antibodies of the invention may be made using the 

hybridoma method first described by Kohler & Milstein, Nature 25£:495 (1975), or may be made by recombinant 

DNA methods [Cabilly, et al % U.S. Pat. No. 4,816,567]. 
35 In the hybridoma method, a mouse or other appropriate host animal, such as hamster is immunized as 

hereinabove described to elicit lymphocytes that produce or are capable of producing antibodies that will 

specifically bind to the protein used for immunization. Alternatively, lymphocytes may be immunized in vitro . 

Lymphocytes then are fused with myeloma cells using a suitable fusing agent, such as polyethylene glycol, to 
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form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice , pp.59- 1 03 (Academic Press. 

1986)]. 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium that preferably 
contains one or more substances that inhibit the growth or survival of the unfused, parental myeloma cells. For 
5 example, if the parental myeloma cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase 
(HGPRT or HPRT). the culture medium for the hybridomas typically will include hypoxanthine, ammopterin. 
and thymidine (HAT medium), which substances prevent the growth of HGPRT-deficient cells. 

Preferred myeloma cells are those that fuse efficiently, support stable high level expression of antibody 
by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. Among these, 
10 preferred myeloma cell lines are murine myeloma lines, such as those derived from MOPC-2 1 and MPC-1 1 
mouse tumors available from the Salk Institute Cell Distribution Center, San Diego, California USA, and SP-2 
cells available from the American Type Culture Collection, Rockville, Maryland USA. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal 
antibodies [Kozbor, J. Immunol. 133 :300 1 ( 1 984); Brodeur, et ai , Monoclonal Antibody Production Techniques 
15 and Applications , pp.5 1-63 (Marcel Dekker, inc.. New York, 1987)]. 

Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against PTP HSC. Preferably, the binding specificity of monoclonal antibodies produced 
by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). The binding affinity of the 
20 monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson & Pollard, Anal. 
Biochem. Ifl7:220(1980). 

After hybridoma cells are identified that produce antibodies of the desired specificity, affinity, and/or 
activity, the clones may be subcloned by limiting dilution procedures and grown by standard methods. Coding, 
Monoclonal Antibodies: Principles and Practice , pp.59- 104 (Academic Press, 1986). Suitable culture media for 
25 this purpose include, for example, Dulbecco's Modified Eagle's Medium or RPMI-1640 medium. In addition, 
the hybridoma cells may be grown in vivo as ascites tumors in an animal. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture medium, 
ascites fluid, or serum by conventional immunoglobulin purification procedures such as, far example, protein 
A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 
30 DNA encoding the monoclonal antibodies of the invention is readily isolated and sequenced using 

conventional procedures (e.g., by using oligonucleotide probes that arc capable of binding specifically to genes 
encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 
preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then 
transfected into host cells such as simian COS cells. Chinese hamster ovary (CHO) cells, or myeloma cells that 
35 do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also may be modified, for example, by substituting the coding sequence for 
human heavy and light chain constant domains in place of the homologous murine sequences. Morrison, et at.. 
Proc. Nat. Acad. Sci. 81 , 685 1 ( 1 984), or by covalently joining to the immunoglobulin coding sequence all or 
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part of the coding sequence for a non-immunoglobulin polypeptide. In that manner, "chimeric" or "hybrid" 

antibodies are prepared that have the binding specificity of an anti-TRAF monoclonal antibody herein. 

Typically such non-immunoglobulin polypeptides are substituted for the constant domains of an 
antibody of the invention, or they are substituted for the variable domains of one antigen-combining site of an 
5 antibody of the invention to create a chimeric bivalent antibody comprising one antigen-combining site having 
specificity for a PTP HSC and another antigen-combining site having specificity for a different antigen. 

Chimeric or hybrid antibodies also may be prepared in vitro using known methods in synthetic protein 
chemistry, including those involving crosslinking agents. For example, immunotoxins may be constructed using 
a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents for this purpose 
10 include iminothiolate and methyl-4-mercaptobutyrimidate. 

For diagnostic applications, the antibodies of the invention typically will be labeled with a detectable 
moiety. The detectable moiety can be any one which is capable of producing, either directly or indirectly, a 
detectable signal. For example, the detectable moiety may be a radioisotope, such as 3 H. l4 C, 32 P. S, or J ~ 5 I, 
a fluorescent or chcmiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin; biotin; 
15 radioactive isotopic labels, such as, e.g., ^I. 32 p ^ 14^ Qr 3^ or an en2Vrne sucn as alkaline phosphatase, 
bcta-galactosidase or horseradish peroxidase. 

Any method known in the art for separately conjugating the antibody to the detectable moiety may be 
employed, including those methods described by Hunter, et ai, Nature 144 :945 (1962); David. : ct ul. % 
Biochemistry 13:1014(1974): Pain, et al. J. Immunol. Meth. 40:219 (1981V and Nvgren. J. Histochem. and 
20 Cvtochem. 30:407(1982). 

The antibodies of the present invention may be employed in any known assay method, such as 
competitive binding assays, direct and indirect sandwich assays, and immunoprecipitation assays. Zola, 
Monoclona l Antibodies: A Manual of Techniques , pp. 147-158 (CRC Press, Inc., 1987). 

Competitive binding assays rely on the ability of a labeled standard (which may be a PTP HSC 
25 polypeptide or an immunologically reactive portion thereof) to compete with the test sample analyte (PTP HSC) 
for binding with a limited amount of antibody. The amount of PTP HSC in the test sample is inversely 
proportional to the amount of standard that becomes bound to the antibodies. To facilitate determining the 
amount of standard that becomes bound, the antibodies generally are insolubilized before or after the 
competition, so that the standard and analyte that are bound to the antibodies may conveniently be separated from 
30 the standard and analyte which remain unbound. 

Sandwich assays involve the use of two antibodies, each capable of binding to a different immunogenic 
portion, or epitope, of the protein to be detected. In a sandwich assay, the test sample analyte is bound by a first 
antibody which is immobilized on a solid support, and thereafter a second antibody binds to the analyte. thus 
forming an insoluble three part complex. David & Greene, U.S. Pat No. 4,376, 1 10. The second antibody may 
35 itself be labeled with a detectable moiety (direct sandwich assays) or may be measured using an anti- 
immunoglobulin antibody that is labeled with a detectable moiety (indirect sandwich assay). For example, one 
type of sandwich assay is an EL1SA assay, in which case the detectable moiety is an enzyme. 
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(iii) 



Humanized antibodies 
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20 



25 



30 



Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized 
antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" 
variable domain. Humanization can be essentially performed following the method of Winter and co-workers 
[Jones et ai. Nature 321 . 522-525 (1986); Riechmann et ai , Nature 332, 323-327 (1988): Verhoeyen et al. y 
Science 239 , 1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (Cabilly, 
supra ), wherein substantially less than an intact human variable domain has been substituted by the corresponding 
sequence from a non-human species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

It is important that antibodies be humanized with retention of high affinity for the antigen and other 
favorable biological properties. To achieve this goal, according to a preferred method, humanized antibodies 
are prepared by a process of analysis of the parental sequences and various conceptual humanized products using 
three dimensional models of the parental and humanized sequences. Three dimensional immunoglobulin models 
are commonly available and are familiar to those skilled in the art. Computer programs are available which 
illustrate and display probable three-dimensional conformational structures of selected candidate immunoglobulin 
sequences. Inspection of these displays permits analysis of the likely role of the residues in the functioning of 
the candidate immunoglobulin sequence, i.e. the analysis of residues that influence the ability of the candidate 
immunoglobulin to bind its antigen. In this way, FR residues can be selected and combined from the consensus 
and import sequence so that the desired antibody characteristic, such as increased affinity for the target 
antigen(s), is achieved. In general, the CDR residues are directly and most substantially involved in influencing 
antigen binding. For further details see U.S. application Serial No. 07/934,373 filed 21 August 1992, which is 
a continuation-in-part of application Serial No. 07/715,272 filed 14 June 1991. 

Alternatively, it is now possible to produce transgenic animals (e.g. mice) that are capable, upon 
immunization, of producing a full repertoire of human antibodies in the absence of endogenous immunoglobulin 
production. For example, it has been described that the homozygous deletion of the antibody heavy chain joining 
region (Jj_j) gene in chimeric and germ-line mutant mice results in complete inhibition of endogenous antibody 
production. Transfer of the human germ-line immunoglobulin gene array in such germ-line mutant mice will 
result in the production of human antibodies upon antigen challenge. See, e.g. Jakobovits et ai, Proc. Natl. 
Acad. Sci. USA 2Q, 2551-255 (1 993); Jakobovits et ai , Nature 362 . 255-258 ( 1 993). 



Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for a PTP 
HSC, the other one is for any other antigen, for example an antigen expressed on the surface of a leukemia cell, 
if the antibody is an antagonist of a native PTP HSC and is used to induce differentiation of undifferentiated 
lekemia cells. If an agonist antibody specifically binding to a native PTP HSC is used to expand stem cells with 
growth factors, as hereinafter described, the second specificity could be provided by a stem cell growth factor. 



(>v) 



Bispecific antibodies 
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* Such constructs can also be referred to as bispecific immunoadhesins. Methods for making bispecifk antibodies 

(and bispecific immunoadhesins) are known in the an. 

Traditionally, the recombinant production of bispecific antibodies is based on the coexpression of two 

immunoglobulin heavy chain-light chain pairs, where the two heavy chains have different specificities (Millstein 

5 and Cuello, Nature 305 . 537-539 ( 1 983)). Because of the random assortment of immunoglobulin heavy and light 

chains, these hybridomas (quadromas) produce a potential mixture of 10 different antibody molecules, of which 

only one has the correct bispecific structure. The purification of the correct molecule, which is usually done by 

affinity chromatography steps, is rather cumbersome, and the product yields are low. Similar procedures are 

disclosed in PCT application publication No. WO 93/08829 (published 13 May 1993), and in Traunecker et a/., 

10 EMfiO JO, 3655-3659(1991). 

According to a different and more preferred approach, antibody variable domains with the desired 
binding specificities (antibody-antigen combining sites) arc fused to immunoglobulin constant domain sequences. 
The fusion preferably is with an immunoglobulin heavy chain constant domain, comprising at least part of the 
hinge, and second and third constant regions of an immunoglobulin heavy chain (CH2 and CH3). It is preferred 

1 5 to have the first heavy chain constant region (CH 1 ) containing the site necessary for light chain binding, present 
in at least one of the fusions. DNAs encoding the immunoglobulin heavy chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are cotransfected into a suitable 
host organism. This provides for great flexibility in adjusting the mutual proportions of the three polypeptide 
fragments in embodiments when unequal ratios of the three polypeptide chains used in the construction provide 

20 the optimum yields. It is, however, possible to insert the coding sequences for two or all three polypeptide chains 
in one expression vector when the expression of at least two polypeptide chains in equal ratios results in high 
yields or when the ratios are of no particular significance. In a preferred embodiment of this approach, the 
bispecific antibodies are composed of a hybrid immunoglobulin heavy chain with a first binding specificity in 
one arm, and a hybrid immunoglobulin heavy chain-light chain pair (providing a second binding specificity) in 

25 the other arm. It was found that this asymmetric structure facilitates the separation of the desired bispecific 
compound from unwanted immunoglobulin chain combinations, as the presence of an immunoglobulin light 
chain in only one half of the bispecific molecule provides for a facile way of separation. This approach is 
disclosed in copending application Serial No. 07/931,81 1 filed 17 August 1992. 

For further details of generating bispecific antibodies see, for example, Suresh et al. y Methods in 

30 Enzvmolog v 121. 210(1986). 

(v) Heteroconjugate antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalentiy joined antibodies. Such antibodies have, for example, been proposed 
to target immune system cells to unwanted cells (U.S. Patent No. 4 ? 676,980), and for treatment of HIV infection 
35 (PCT application publication Nos. WO 91/00360 and WO 92/200373; EP 03089). Heteroconjugate antibodies 
may be made using any convenient cross-linking methods. Suitable cross-linking agents are well known in the 
art, and are disclosed in U.S. Patent No. 4,676,980, along with a number of cross-linking techniques. 
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H. Peptide and non-peptide analogs of polypeptide PTP HSCs 

Peptide analogs of the PTP HSC polypeptides of the present invention are modelled based upon the 
three-dimensional structure of the native polypeptides. Peptides may be synthesized by well known techniques 
such as the solid-phase synthetic techniques initially described in Merrifleld, J. Am. Chem, Soc. 15 . 2 149-2 1 54 
5 (1963). Other peptide synthesis techniques are, for examples, described in Bodanszky et a/.. Peptide Synthesis, 
John Wiley & Sons, 2nd Ed., 1976, as well as in other reference books readily available for those skilled in the 
art. A summary of peptide synthesis techniques may be found in Stuart and Young, Solid Phase Peptide 
Synthelia, Pierce Chemical Company, Rockford, 1L (1984). Peptides may also be prepared by recombinant DNA 
technology, using a DNA sequence encoding the desired peptide. 
10 In addition to peptide analogs, the present invention also contemplates non-peptide (e.g. organic) 

compounds which display substantially the same surface as the peptide analogs of the present invention, and 
therefore interact with other molecules in a similar fashion. 

I. Use of the PTP HSCs 

The PTP HSCs of the present invention are useful for a variety of purposes. For example, native PTP 
15 HSCs are useful for the identification and isolation of a PTP HSC analog in another mammalian species. Native 
PTP HSCs and their functional equivalents are also useful in screening assays designed to identify agonist of 
antagonist of native PTP HSCs. Such assays may take the form of any conventional cell-type or biochemical 
binding assay, and can be performed in a variety of assay formats well known for those skilled in the art. As 
example is the so called "two-hybrid" assay format using the Matchmaker Two-Hybrid System (Clontech) 
20 according to the manufacturer's instructions. 

The PTP HSCs of the present invention as well as their agonists can additionally be used for the 
maintenance of stem/progenitor cells in cell culture. Agonists which inhibit differentiation but allow for 
hematopoietic stem cell growth are particularly useful for this purpose, since their use results in an amplification 
of the stem cells without differentiation (self-renewal). This process might be useful, as an example, for the 
25 expansion of hematopoietic stem cells prior to autologous or heterologous bone marrow transplantation. The 
same approach can be used in vivo for the expansion of stem cells with growth factors, in the absence of 
diferentiation. 

It is believed that the native PTP HSCs of the present invention may be expressed in leukemic cells. 
Accordingly, antagonist of the PTP HSCs of the present invention may be used for the induction of 
30 differentiation of undifferentiated leukemia cells. This might allow for aggressive undifferentiated leukemia cells 
to become differentiated, which, in turn, facilitates their treatment. 

PTP HSC antagonists may also be used to induce differentiation of hematopoietic stem cells. As 
inhibition of the native PTP HSC enzyme might induce progenitor cells to differentiate, an antagonist of PTP 
HSC might act as a pan-induccr of myeloid, crythroid and lymphoid production. This use of PTP HSC 
35 antagonists may obviate or decrease the need for the use of stem cell growth factors. 

Further details of the invention are illsutrated in the following non-limiting examples. 
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Example 1 

Identification and cloning of murine PTP HSC 
A. Materials and Methods 
Isolation of embryonic hn*°CD34 ni Sca*" hematopoietic stem cells. Yolk sacs or embryos were dissected from 
5 timed pregnant females at day 10.5. Fetal livers were isolated from day 13.5-14 embryos. Yolk sac and 

o 

embryonic tissues were dissociated with 1% collagenase in RPMl medium at 37 C for 15 minutes. Cells were 
further dissociated by two passages through a 16 gauge needle. Fetal liver was only dissociated by passage 
through a 16 guage needle. Adherent cells were attached to plastic by overnight incubation, after which the non 
adherent hematopoietic cells were incubated with a lineage cocktail of antibodies (1 ng each of TER 1 19, Gr- 1 , 

10 Ly-1, transferrin receptor and B220) for 1 hr on ice. Cells were washed, and the lineage positive cells were 
depleted using magnetic beads and a Miltenyi column. Lineage negative cells were pelleted, resuspended in 2% 
FCS, PBS and incubated with rabbit anti-murinc CD34 antibody (Baumhueter el al. Science 262 , 436-38 
[1993]) on ice for 1 hr. Cells were washed three times in 2% FCS, PBS, resuspended in the same buffer and 
incubated with donkey, anti-rabbit FITC conjugated antibody and. in some cases, PE conjugated anti Sea 

1 5 antibody for I hr on ice. The cells were washed five times with 2% FCS, PBS, and than isolated by cell sorting 
on an ELITE cell sorter. 

PCR analysis of mRNA isolated from lin lo CD34* 1 'Sca n ' hematopoietic stem cells. Messenger RNA 
was isolated from the Lin^°CD34 m Sca m fraction of fetal yolk-sac hematopoietic cells (Micro- FasiTrack. 
InVitrogene). Poly A+ RNA was reverse transcribed with random hexamers (Promega) and Moloney murine 

20 Leukemia virus revere transcriptase (Superscript II, GIBCO BRL). 1/4 of this cDNA was amplified by PCR 
using degenerate mixed oligonucleotides primers. Sense and antisensc primers corresponding to the concensus 
PTP amino acid sequences H / D FWRM 1 /yW (5 , -A C / T TT C / T TGG A / c GlATG A / G T^TGG-3 , ) (SEQ ID. NO: 
14, where the degenerate positions are designated by "N") and WPD F /|jGVP (5'- 
GGIAC G / A T / A G / A G / A TClGGCCA-3') (SEQ. ID. NO: 1 5, wherein the degenerate positions are designated by 

25 "N") respectively were used. PCR were carry out in IX Taq DNA polymerase buffer (GIBCO BRL) plus 0.2 
mM of each dNTP, 10% DMSO and 5 units Taq polymerase (GIBCO BRL) for 25 cycles of 94° C for 1 minute, 

o o 

55 C for 1 min and 72 C for I minute. The PCR products were treated with Klenow enzyme (New England 
Biolabs) at 30° C for 30 minutes, cloned into Smal site of pRK-5 (EP 307,247, published March 15, 1989) 
plasmid, and subsequently sequenced (Sequenase, USB). 

30 cDNA and genomic cloning. Adapter-linked double strain cDNA was prepared from A+ RNA of day- 

10 murine embryos (Marathon-ready cDNA synthesize kit, Clontech) using either random hexamer or oligo dT 
primer. Full-length cDNA was isolated by 5' or 3* rapid amplification of cDNA ends (RACE) of the marathon- 
ready cDNAs. Genomic clones encoding the PTP HSC gene were isolated using standard techniques. The plaque 
purified lambda phage DNA was digested with Not 1, and the insert fragment was directly cloned without 

35 purification into Not 1 digested Bluescript. Exons were mapped using a combination of restriction digestion and 
southern blotting as well as DNA sequencing using custom primers. 

Bacterial expression of the PTP. cDNA sequences encode amino acid 8 to 323 containing the 
phosphatase domain were obtained by PCR using sense oligomer 5- 
CACGGTCGACGGTGAGGAGCrrCTTTGAGCAGCTGGAGG-3' (SEQ. ID. NO: 3). and antisense oligomer 
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5'-GTTGCGGCCGCGATTGGAGCGCAGTTCTCCTTGAGGTTCTGG-3 t (SEQ. ID. NO: 4). The PCR 

fragment was treated with Sail and NotI restriction enzyme and cloned into Sail and NotI digested pGEX-4T- 1 

plasmid (Pharmacia). Fusion protein was affinity purified using a glutathione sepharose column (Pharmacia). 

Tyrosine phosphatase assays on the GST-fusion protein were carried out following the manufacture's procedure 

5 using two different tyrosine phosphorylated peptides from a tyrosine phosphatase assay kit (Boehringer 

Mannheim). 

Quantitative PCR analysis of RNA isolated from hematopoietic cells. cDNA was made from + 
RNA by reverse transcription (RT) with random hexamer. PCR was then used to amplified quantitatively PTP 
HSC cDNA and, as an internal standard, triosephosphate isomerase (TPI) cDNA. For each PCR, 6 ul of the 20 

10 ul RT reaction was brought to 50 ul so as to contain 0,3 mM of dNTPs, 4uCi of 32 P dATP (3,000Ci/mmol, 
Amersham), 100 pmol of each of the four primers, and 5 units of Taq DNA polymerase (GIBCO BRL). 
Seventeen PCR cycles of 94° C for 50 seconds, 55° C for 50 seconds, and 70° C for 70 seconds. One-tenth of 
each PCR samples was elecrrophoresed in a 6% polyacrylamide gel, and the PCR products were quantitate by 
phosphorimaging (Fuji). Conditions for accurate quantitation of either PTP HSC or TPI were assessed in 

1 5 experiments that used serial dilutions of a standard preparation of A ' RNA from 32D cells to determine for each 
primer pair the times of primer annealing and primer extension and the cycles that provided for a linear 
correlation between the amount of template RNA and the PCR product. Under the PCR conditions ultimately 
chosen, certain amount of sample RNA was analyzed simultaneously with serial dilutions of the standard RNA 
and a reverse transcriptase minus control. 

20 Northern blot analysis of tissues and cell lines. A Sall-NotI 1.3 kb PTP HSC cDNA fragment was 

used to probe murine multi-tissue northern blot (Clontech). The same northern blot was used with various other 
probes, ah of which demonstrated detectable, undegraded transcripts. 
PCR primer pairs 

5' RACE primers: antisense primer S'-CCTGGAGGGTCCTGAGAGTGATGTCTGCATTCAGTG^' (SEQ. ID. 
25 NO: 5), 5 f -CCTCTTGGAGCAGGGAAAGGATGACTCTTGTCTC-3' (SEQ. ID. NO: 6), 5- 
CAGCTGCTCCAAGAAGCTCCTCACCAAGTC-3' (SEQ, ID. NO: 7). Sense primer: API and AP2 
(Clontech). 

3'RACE primers: sense primer 5*-GGTAGAGGTGGGCAGGGTG AAGTGTTCTCGC-3* (SEQ. ID. NO: 8), 5'- 
CACTGAATGCAGACATCACTCTCAGGACCCTCCAGG-3* (SEQ. ID. NO: 9), 5 f - 
30 GAGACAAGAGTCATCCTTTCCCTGCTCCAAGAGG-3' (SEQ. ID. NO: 10). Antisense primer: API and 
AP2 (Clontech). 

Quantitative RT-PCR primers: PTP HSC sense primer 5- 

CACTGAATGCAGACATCACTCTCAGGACCCTCCAGG-3' (SEQ. ID. NO: 9), antisense primer 5'- 
GAATGGTAACCTGGAGGGTCCTGAG-3' (SEQ. ID. NO: 11). TPI sense primer 5- 
35 GAGAAGGTCGTGTTCGAG (SEQ. ID. NO: 12), antisense primer 5'-GTGTACTTCCTGTGCCTG-3* (SEQ. 
ID. NO: 13). 

B. cDN A cloning of PTPs from Hematopoietic Stem Cells 

In order to analyze PTPs potentially involved with the maintenance of the hematopoietic stem cell, we 
isolated a highly purified population of these cells from either the murine 10.5 day yolk sac or embryo. 
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Previously, we showed that both progenitor activity as well as stromal cell repopuJating activity were found in 

the CD34 m fraction of these embryonic cells [3] (C. Fennie and L. Lasky-unpublished observations). In addition. 

others have shown that the murine CD34 n ^ population isolated from bone marrow (Krause et ai. Blood 84(3). 

691-701 [1994]), or fetal liver (Ziegler et ai, Blood 84. 2422-2450 [1994]) contains stem cells capable of 

5 reconstituting lethally irradiated animals. In order to isolate a more highly purified fraction of these progenitor 

cells, we included a lineage depletion step as well as a positive selection step with the Sea antibody (Uchida et 

ai, Blood 52(12), 3758-3779 [1994]), in addition to the CD34 antibody. These morphologically, primitive 

hematopoietic cells show a higher degree of stromal cell repopulating ability as well as cobblestone formation 

as compared to the previously described CD34 n * progenitor cells, and we are currently investigating their in vivo 

1 0 repopulating activity (C.Fennie and L. Lasky-unpublished observations). Previous investigators have shown that 

lo hi 

the lin Sca ni fraction of bone marrow hematopoietic cells has a high level of repopulating activity (Sprangrudc 

et ai. Science 241 . 58-62 [1988]). Thus, it is likely that the lin lo CD34 ni Sca hi cells isolated from the early 
embryo contain self renewing hematopoietic stem cells (Uchida et ai, supra: Krause ei ai., supra: Ziegler et ai. r 
supra. $ 
15 Consensus PCR using primers derived from two highly conserved regions of the PTP phosphatase 

domain resulted in the cloning and sequencing of- 70 PCR fragments. As shown in Table 1, a diversity of known- 
receptor and non-receptor PTPs were detected in this fraction of these progenitor cells, and many of these PTPs s *> 
have not previously been described in the hematopoietic stem cell compartment. Two novel PTPs (referred to 
in the table as PTP 38 and PTP 49) were also isolated. One is a receptor PTP which is related to the *- 

20 homotypically interacting u, k and LAR family and is the subject of a patent application filed concurrently ' --^ 
herewith. The second PTP was found to be most homologous to two previously described non-receptor PTPs, 
murine PTP PEP (Matthews et ai, Mol. Cell Biol. 12(5), 2396-2405 [1992]) and murine/human PTP PEST 
(Takekawa et ai, Biochem. Biophvs. Res. Commun. J_£9(2), 1223-1230 [1992]; Yang et ai, J. Biol. Chem. - & 

268(23) 17650 [1993]; and Charest et ai. Biochem J. 308 (2). 425-432 [1995]), both of which contain a region 

25 that is very high in proline, glutamate, serine and threonine (the "PEST' domain). One of these PTPs, PEP, has 
been demonstrated to be localized to the nucleus (Flores et ai, supra) (see below), so it appeared that the novel 
PTP fragment may have been a new member of this potentially nuclear-localized PTP family. 

Initial PCR and northern analyses with the PTP fragment revealed that the transcript encoding this 
enzyme is extremely rare in embryonic and adult tissues. Thus, the full length cDNA was cloned using the RACE 

30 procedure and RNA isolated from day 10 embryos. Because the RACE cloning of the 5 prime region was 
particularly difficult, the final 5 prime sequence was confirmed using the genomic clone encoding this PTP. As 
can be seen in figure 1, this transcript encodes an open reading frame of 453 amino acids specifying a protein 
of molecular weight 50,253 daltons. Homology searches revealed that the region encoding amino acids 25-290 
were highly homologous to a variety of PTPs, with the highest degree of homology with murine PTP PEP 

35 (Matthews et ai t supra) and murine/human PTP PEST (Takekawa et ai. supra; Yang et ai. supra; and Charest 
et ai. supra) (figure 2). Interestingly, PTP PEP has also been found to be expressed in mature hematopoietic 
cells (Matthews et ai, supra, Flores et ai, supra) although human and murine PTP PEST appear to have a more 
generalized expression pattern (Yang et ai. supra: Charest et ai, supra). As has been shown in these two 
previously described PTPs, the novel PTP reported here contains a region 3 prime of the PTP domain which is 
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very rich in proline, serine, and threonine (-29%) (boxed residues in figure I). This region lacks other significant 

homology with PTPs PEP and PEST, and it is also much shorter in the novel PTP described here. Finally, a 

short region of 20 amino acids at the very carboxy terminus of the protein is highly homologous to similar 

carboxy-tcrminal regions in PTPs PEP and PEST (figure 2). This region is rich in basic residues and the 

5 homologous area in PTP PEP has been shown to be involved with the localization of this enzyme to the nucleus 

(Flores et ai, supra). However, this region also contains two negatively charged residues (arrowheads in figure 

2). so it is likely that this novel PTP is a cytoplasmically localized enzyme, as has been demonstrated for PTP 

PEST (Charest et ai, supra). Finally, the novel PTP described here contains a serine residue at position 37 

(shown starred in figure 2) which is conserved in all three members of this family and which has been shown to 

10 be phosphorylated in PTP PEST by protein kinases C and A (Garton and Tonks, EM BO J. ii( 1 6), 3763-7 1 
[1994]). Interestingly, increased phosphorylation at this site is inhibitory to the PTPasc activity of this PTP 
(Banville et ai. Genomics 27( 1 ). 165-173 [1995]). In summary, the novel PTP described here appears to be a 
new member of a family of non-receptor PTPs which contain P, S and T rich regions (figure 3). In addition, all 
three of these PTPs contain a homologous carboxy-terminal region which has been shown to function as a nuclear 

1 5 localization signal for one of the family members (PTP PEP), although the murine PEST enzyme has been found 
to localize to the cytoplasm. 

Previous analyses of the genomic structures of other PTPs suggested that these enzymes were 
constructed from genes containing a large number of introns. This appears to be the case for the novel PTP 
described here as well. As can be seen from figure 4, the hematopoietic progenitor cell PTP gene is subdivided 

20 by 14 introns. Analysis of the intronic structure of this novel PTP as compared with that found for other PTPs 
suggests that the novel progenitor cell enzyme is divided into a comparable number of coding exons (for 
example, Banville et ai, supra), in addition, as described below, there appears to be at least one other smaller 
transcript, as well as a heterogeneous collection of large transcripts, suggesting that alternate splicing may occur 
in this gene. Finally, chromosomal localization studies have demonstrated that the gene encoding the human 

25 form of this PTP is found on chromosome 14 (D. Dowbenko and L. Lasky, unpublished data). 

While the sequence of the N-terminal PTP domain contained many of the conserved amino acids found 
to be critical for substrate recognition and tyrosine dephosphorylation (Jia et ai., supra), it was important to 
demonstrate that this sequence indeed encoded an active PTP domain . To this end we produced a construct using 
the glutathione-S-transferase (GST) fusion system which contained the entire PTP-homologous region derived 

30 from the novel cDNA clone. The protein was isolated from induced cultures of bacteria, and it was tested for 
the dephosphorylation of tyrosine using two different phosphorylated peptides (see materials and methods). As 
can be seen from figure 5, the isolated GST-PTP domain fusion protein had a very high level of PTP activity, 
with significant dephosphorylation at only 20 picograms of enzyme per reaction, which was partially sensitive 
to inhibition by orthovanadate. The only partial inhibition of enzyme activity by orthovanadate was likely due 

35 to the high level of activity as well as insufficient levels of the inhibitor. These data indicate that this 
hematopoietic progenitor cell PTP is an active tyrosine phosphatase. 

C. Expression of the progenitor cell PTP transcript 

The isolation of the novel PTP from the lin*°CD34 m sca ni population of hematopoietic stem cells 
suggested that this PTP might be specific for very early progenitor cells. As figure 6A illustrates, quantitative 
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PCR comparing the levels of the transcript encoding this PTP in the lin lo CD34 m sca ni . a largely undifferentiated 
population containing hematopoietic stem cells (Spangrude et ai, supra; Krausc et ai.. supra: Zeigler et ai, 
Blood 84(8). 2422-2430 [1994]), versus the lin lo CD34 h, sca l ° population, a more differentiated cell population 
(Spangrude et ai, supra), containing committed progenitors, demonstrated that there was an approximately 1 0 
5 fold lower level of the transcript in the more differentiated sca l ° cells, in order to examine if this 
downregulation continued as differentiation progressed, quantitative PCR was performed using RNA isolated 
from suspension cultures of lin lo CD34 ni sca hi cells that were exposed to IL-i, IL-3, EPO and GM-CSF for 
various periods of time in the absence of stromal cells. Analysis of cell numbers, together with Wright-Giemsa 
staining of the cultures, revealed that the undifferentiated lin lo CD34 ni sca ni cell population dramatically 

10 expanded in the presence of these growth and differentiation factors and also metamorphosed along the myeloid 
pathway to ultimately give rise to cultures that contained predominately macrophages after 14 days (data not 
shown). As figure 6B illustrates, the transcript encoding the novel PTP disappears as the cells replicate and 
develop, and it is completely absent after approximately 7 days in culture. These data are consistent with a role 
for this PTP in early stem or progenitor cells, but not in the mature, committed cell populations. 

15 The potential importance of this PTP specifically to the hematopoietic system is illustrated in figure 7 A 

where northern blot analyses of various tissues and cell lines are shown. As can be seen from this figure, the 
transcript appears to be undetectable in the embryonic samples, and it is expressed at exceedingly low levels in 
adult lung and kidney. Thus, while there are clearly hematopoietic stem cells in the embryo, they must be so rare 
as to not allow for the direct detection of the transcript encoding the novel PTP. Particularly interesting is the 

20 lack of a signal in the RNA isolated from the adult spleen, a hematopoietic compartment that contains 
predominately mature, differentiated hematopoietic cells and which was previously shown to express PTP PEP 
(Matthews et ai. t supra). The very faint transcripts detected in the lung have been confirmed by non -quantitative 
PCR analysis (J. Cheng and L. Lasky-unpublished data). However, the transcripts in the lung are very rare and 
may be aberrant, since screening of an adult lung library (1x10^ clones) resulted in only two positive isolates, 

25 both of which contained introns (J. Cheng and L. Lasky-unpublished observations). 

The lack of detectable signal in most tissues of the adult and embryo, coupled with the identification 
of the transcript in the highly purified stem cell population, but not in the differentiated hematopoietic cells, 
suggested that this PTP might be expressed in hematopoietic progenitor cell lines. As figure 7B illustrates, the 
transcripts encoding this novel PTP are easily detectable in the three different murine hematopoietic progenitor 

30 cell lines tested by both northern and PCR analyses. In all three cases, these lines represent relatively 
undifferentiated precursors of mature hematopoietic cells, although they are certainly not self-renewing stem 
ceils. The cells appear to encode two major transcripts, in addition to a diversity of minor transcripts. One major 
transcript is an -1 .8 kB RNA that corresponds to the cDNA clone described above, while the other encodes a 
-0.7 kB RNA that remains to be characterized. However, it is likely that this smaller transcript is due to 

35 alternative splicing, since, as described above, the gene encoding this PTP is divided into a large number of 
exons (Figure 4). Figure 7C illustrates that the PTP HSC transcript is undetectable by PCR in a differentiated 
T cell clone, a result which is again consistent with the downregulation of this PTP in differentiated cells. Finally, 
PCR analysis of various human cell lines using the murine primer pair revealed expression of a similarly sized 
fragment in human CMK progenitor cells, and the sequence of this PCR fragment revealed that the human 
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homologuc is highly conserved with the murine PTP (J. Cheng, Kai Wu and L. Lasky-unpublished results). In 

summary, the novel PTP described here appears to be expressed predominately in very early hematopoietic 

progenitor cells, consistent with a potential role in the regulation of the differentiation state of these cells 

D. Discussion 

5 The ability of the hematopoietic stem cell to self renew in the absence of differentiation is an important 

factor which allows for this cell to provide a large number of progeny throughout the lifetime of the organism. 
The maintenance of the undifferentiated state must occur at the same time as the stem cell replicates, since this 
cell type must be continually replenished. Thus, there must be specific mechanisms that decrease some aspects 
of cellular activation, such as differentiation, while not affecting others, such as division. Because tyrosine 

10 phosphorylation is a critical aspect of cellular activation, based upon the results disclosed herein, it is likely that 
distinctive mechanisms which regulate tyrosine phosphorylation are involved with the maintenance of the self 
renewing stem cell. Such specificity can be accomplished in part by the expression of the appropriate growth 
factors by the hematopoietic cell stroma. However, another means by which such regulation can occur is by the 
dephosphorylation of a subset of tyrosine phosphorylated proteins. One mechanism that would allow for specific 

1 5 dephosphorylation is via PTPs which recognize only a fraction of the tyrosine phosphorylated proteins in the cell. 
Thus, the analysis of PTPs expressed by hematopoietic stem cells might further our understanding of the 
mechanisms by which stem cell self renewal is attained. The non-receptor PTP described in the present 
application has some of the features that might be expected for a regulator of stem cell differentiation. 

Several aspects of this novel PTP, which is referred to throughout the specification and claims as the 

20 PTP of hematopoietic stem cells or PTP HSC, are consistent with a role in the regulation of aspects of early 
hematopoietic progenitor cell biology. First, the specific expression of the transcript in very early hematopoietic 
progenitor ceils, together with the down-regulation of the message as the cells differentiate, is compatible with 
a role for this enzyme in physiological aspects of the less differentiated stem cell. While little is understood 
regarding the regulation of genes in very early hematopoietic progenitor cells, the apparently unique expression 

25 of this gene predominately in these comparatively undifferentiated cells suggests that novel mechanisms of 
transcriptional regulation might be utilized in the control of this locus (Orkin, Curr. Opin. Cell Biol. 7(6). 870- 
877 [1995]). In addition, the predominate lack of expression of this PTP in most adult tissues, with the exception 
of extremely low levels in the lung and the kidney, is also consis ent with a role for this enzyme specifically 
within the hematopoietic progenitor cell compartment. This is in stark contrast to the expression of PTP PEP, 

30 which is found in the lymphoid compartment (Takekawa et al, supra), and PTP PEST, which is apparently 
ubiquitously expressed in a number of cell lines and tissues (Yang et al., supra). Second, the PTP domain can 
be thought of as a moderator of cell activation by virtue of its ability to dephosphorylate tyrosine residues. 
Tyrosine phosphorylation can either up- or down-regulate the activities of various proteins (Fantl et al.. supra). 
so that the PTP HSC might activate or inhibit a specific subset of tyrosine phosphorylated proteins. In a cell that 

35 requires a down-regulation of differentiation, this type of specific modulation would allow for the control of the 
phosphotyrosine levels of proteins activated by various growth factors produced by the hematopoietic stroma 
Together, these data are compatible with a function for this enzyme in the modulation of development of the stem 
cell that is induced by the various growth factors produced by the hematopoietic microenvironment. 
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The hypothesis that PTPs such as PTP HSC arc involved with the maintenance of an undifferentiated 

state in the hematopoietic stem cell suggests possibilities regarding the substrates recognized by this type of PTP. 
Several of the substrates for the PTPs have been previously characterized. For example, the alpha PTP. a 
receptor PTP, has been found to regulate the levels of sre tyrosine phosphorylation which results in 
5 differentiation of neuronal progenitor cells. Lar, as well as CD45, are apparently involved with the regulation 
of the tyrosine phosphorylation levels of the insulin receptor (Kulas et ai, J. Biol. Chem. . 271 (2). 748-754 
(1996); Kulas etai A. Biol. Chem. 271 (2). 755-760 [ 1996]). From the standpoint of hematopoicsis. the SH 2 
domain containing PTP 1C phosphatase has been shown to be critically involved with the regulation of myeloid 
development in the motheaten mouse as well as with the activation state of the EPO receptor (Schulz et ul. 

10 supra: McCuiloch (Klingmuller et ai., supra). Finally, another SM2-containing PTP, PTP 1 D has been found 
to positively regulate the activity of the prolactin receptor (Ali et al. f EMBO J. JJ>( 1 ), 1 35- 1 42 [ 1 996]). These 
examples, among others, are consistent with a role for cytoplasmically- localized PTP domains in the regulation 
of a variety of cellular processes. However, the nature of the substrates recognized by the rarer nuclear PTP 
family is unknown. The dual specificity (i.e. tyrosine and serine/threonine dephosphorylation) phosphatase 

1 5 encoded by the cdc25 locus is a nuclear enzyme that is critical for the regulation of mitosis (Gautier et ai. Cell 

67(1). 197-21 1 [1991 ]). In addition, PAC-1. another nuclear localized PTP, appears to be involved with the ^ 
regulation of the mitogen activated protein kinases. A recently described dual specificity phosphatasc;iTYP 1 . 
related to the vaccinia virus VH 1 phosphatase, appears to be involved with the regulation of both the ERK and 
JNK family of mitogen activated protein kinases (King et ai, Oncogene J_L 2553-2563 [1995]). These data 

20 suggest that several currently described phosphatases appear to play roles in the regulation of tyrosine 
phosphorylated nuclear proteins. 

Another possible substrate for both the nuclear and cytoplasmic PTP enzymes are the STAT proteins. 
These transcriptional activators encompass a family of at least 6 different members, all of which are activated 
by the JAK tyrosine kinases (Darnell et ai., Science 264 (5164). 141501421 [1994]; Ihle et ai., Annu. Rev. 

25 Immunol. 13, 369-398 [1995]). JAK phosphorylation is stimulated by the formation of receptor complexes that 
are stimulated by the binding of various hematopoietic and other growth factor-like molecules (Darnell et ai. 
supra). The phosphorylated STAT proteins than dimerize, migrate to the nucleus and bind specifically to various 
DNA elements that regulate the transcription of growth and differentiation genes (Shuai et at.. Science 
2£i(5129). 1744-1746 [1993]; Heim et ai., Science 267(5202), 1347-49 [1995]). Thus, because these 

30 transcription factors are linked with the activation of hematopoietic differentiation factors, they provide appealing 
targets for negative regulation in hematopoietic stem cells. The absolute requirement for tyrosine 
phosphorylation of these transcriptional activators thus suggests that the novel PTP reported here could regulate 
STAT activation via dephosphorylation of tyrosine residues. In this manner, the upregulation of genes specific 
to the differentiated state could be inhibited by the dephosphorylation of one or more activated STAT molecules. 

35 This hypothesis is especially appealing in the case of the hematopoietic stem cells. In this case, the activation 
of the STAT proteins by the binding of various hematopoietic growth and differentiation factors, a state which 
would induce terminal differentiation, could be downregulated by a stem cell specific PTP such as PTP HSC. 
If this hypothesis is correct the manner by which specific STAT dephosphorylation occurs must be investigated. 
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However, it is possible that the proline, serine, threonine rich domain ofPTP HSC might function to bind to only 

a subset of STATs. 

Finally, recent data have shown that PI P PEST can associate with the p52 snc and p66 snc SH2- 
containing adaptorproteins in a protein kinase C dependent fashion (Habib el al., J. Biol. Chem. 269 (4 1 ). 25243- 
5 25246 [1994]). This association was through an interaction between the N-terminal region of SHC and the 
carboxy-terminal P,S,T rich region of the PTP PEST. The fact that this association was enhanced by protein 
kinase C suggested that serine or threonine phosphorylation might be involved, and a serine in the P.S.T rich 
region of PTP PEST is known to be phosphorylatcd by protein kinase C (Garton and Tonka, supra). 
Interestingly, carbachoh an activator of G protein coupled signaling, was also able to stimulate this association, 
1 0 suggesting that PTP PEST may be involved with the cross talk between G coupled and tyrosine kinase pathways. 
Because of the similarity of PTP HSC to PTP PEST, we suggest that the novel hematopoietic cell PTP of the 
present invention may also interact with SHC, and we are currently examining this possibility using the yeast two 
hybrid system. 

In summary, the data disclosed in this example suggest that hematopoietic stem/progcniior cells 
1 5 specifically express a PTP which appears to be downregulated as the cells differentiate. The PTP seems to be 
predominately specific to hematopoietic progenitor cells, suggesting an important role in the development of this 
cell compartment. However, while these data are potentially important, a number of studies remain to be 
accomplished. Thus, the possibility that the STATs arc substrates for this enzyme, the possible interaction of 
the enzyme with SHC, the constitutive expression of the enzyme in transfected cells and in transgenic animals, 
20 and the effects of null mutations at this locus in vivo may provide for further insights into the mechanisms by 
which stem cell self renewal is regulated. 
Example 2 

Cloning of a human PTP HSC 

Two oligonucleotides (sense: 5'ACTTGGTGAGGAGCTTCTTGGAGCAGCTGGAGG3' (SEQ. ID. 
25 NO: 20), and antisense: 5*GGAATGTAACCTGGAGGGTCCTGA3* (SEQ. ID. NO: 21)) were used as PCR 
primers with reverse transcribed RNA isolated from human CMK hematopoietic progenitor cells. The conditions 
for PCR were identical to those described in Example I for the isolation of the PCR fragment encoding murine 
PTP HSC. The PCR fragment was subcloned into pBS (Bluescript) plasmid, and the DNA sequence was 
determined as described for the murine sequence in Example 1 . The partial nucleotide sequence ana deduced 
30 amino acid sequence of the human PTP HSC are shown in Figure 8. 
Example 3 

Expression of the murine and human PTP HSC 

The native murine PTP HSC polypeptides are expressed in mammalian cells using standard techniques 
Briefly, a DNA fragment encoding the entire PTP HSC is ligated into an expression vector (e.g. PRK5). The 
35 expression vector is then transfected into mammalian cells (e.g. embryonic kidney 292 cells), and the protein 
expression is determined using a monoclonal or polyclonal antibody directed against the native PTP HSC to be 
expressed. 

All documents cited throughout this application as well as the documents cited therein are hereb\ 
expressly incorporated by reference. 
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Table 1 

PTPs expressed in lin^° CD34* 1 * hematopoietic progenitor cells 



Name (GenBank) 


Frequency (%) 


Type 


MMPRTYPHA 


-27 


receptor, single catalytic domain 


MUSC57B16A 


-17 


cytoplasmic, band 4.1 homology 


MUSHCPA 


-14 


cytoplasmic SH2 domains, hematopoietic cells 


MMPTPNU3 


-II 


receptor 


MMMPTPPES 


-4 


cytoplasmic, pst DOMAIN 


MUSCPTP 


-4 


cytoplasmic 


MUSPTPA 


-4 


receptor, kappa, homophilic interacting 


MMTPBLR 


-3 


receptor, epithelial cells, membrane binding 


RNU28356. 


-3 


cytoplasmic 


RATOSTP 




receptor, FNIII domains 


MUSPTPRL 10 




cytoplasmic, band 4. 1 homology 


M60103 




receptor, CD45 


PTP-38 (novel) 




cytoplasmic. PST family related 


PTP-49 (novel) 




receptor related mu/kappa family 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Genentech, Inc. 
(ii) TITLE OF INVENTION: Protein Tyrosine Phosphatases 
5 (iii) NUMBER OF SEQUENCES : 2 3 

(iv> CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 460 Point San Bruno Elvci 

(C) CITY: South San Francisco 
10 ( D ) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb fioppy disk 
15 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS- DOS 
{ D ) SOFTWARE: WinPatin {Genentech) 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
20 (B) FILING DATE: 

(C) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 
(A) NAME: Dreger, Ginger R. 
<B) REGISTRATION NUMBER: 33,055 
25 (C) REFERENCE / DOCKET NUMBER: P1010PCT 

{ ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: 415/225-3216 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 

30 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
35 (D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTCAGAGCGG GTCGCAGCAT GAGTCGCCAT ACGGACTTGG TGAGGAGCTT 50 

CTTGGAGCAG CTGGAGGCCC GGGACTACCG GGAGGGGGCA ATCTTCGTTC 100 

GTGAGTTCAG CGACATTAAG GCCCGCTCAG TGGCCTGGAA GTCTGAAGGT 150 

40 GTGTGTTCCA CTAAAGCCGG CAGTCGGCTT GGGAACACGA ACAAGAACCG 200 

CTACAAAGAT GTGGTAGCAT ATGATGAGAC AAGAGTCATC CTTTCCCTGC 2 50 

TCCAAGAGGA GGGACATGGA AATTACATCA ATGCCAACTT CATCCGGGGC 300 

ATAGATGGAA GCCAGGCCTA CATTGCGACG CAAGGACCCC TGCCTCACAC 3 50 

ACTGTTGGAC TTCTGGCGCC TGGTTTGGGA GTTTGGGGTC AAGGTAATCC 4 00 

45 TGATGGCCTG TCAAGAGACA GAAAATGGAC GGAGGAAGTG TGAACGCTAT 4 50 

TGGGCCCGGG AGCAGGAGCC TCTAAAGGCT GGCCCTTTCT CCATCACCCT 5C0 
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GACAAAGGAG ACAACACTGA ATGCAGACAT CACTCTCAGG ACCCTCCAGG 5 50 

TTACATTCCA GAAGGAAT.TC CGCTCTGTGC ACCAACTACA GTATATGTCC 60 0 

TGGCCAGACC ACGGGGTTCC CAGCAGTTCT GATCACATTC TCACCATGGT 650 

GGAGGAGGCC CGCTGCCTCC AAGGGCTTGG ACCTGGACCC CTCTGTGTCC 7 00 

5 ACTGCAGTGC TGGCTGCGGA CGAACAGGTG TCCTGTGCGC TGTTGACTAT 7 50 

GTGAGGCAGT TGCTGCTGAC CCAGACAATC CCTCCCAACT TCAGTCTCTT 8 00 

CCAAGTGGTC CTGGAGATGC GGAAACAGCG GCCTGCAGCA GTGCAGACAG 8 50 

AGGAGCAGTA CAGGTTCCTG TACCACACAG ' TGGCTCAGCT ATTCTCCCGC 9 00 

ACTCTCCAGG ACACCAGCCC CCAATACCAG AACCTCAAGG AGAACTGCGC 9 50 

10 TCCAATCTGC AAGGAAGCTT TCTCCCTCAG GACCTCCTCA GCCCTGCCTG 1000 

CCACATCCCG GCCACCAGGA GGGGTTCTCA GGAGCATCTC GGTGCCTGCG 10 50 

CCCCCGACCC TCCCCATGGC TGACACTTAC GCTGTGGTGC AGAAGCGTGG 1100 

CGCTTCGGCG GGCACAGGGC CGGGGCCGCG GGCGCCCACC AGCACGGACA 1150 

CCCCGATTTA CAGCCAGGTG GCTCCACGTG CCCAGCGACC GGTGGCACAC 1200 

15 ACGGAGGACG CACAGGGGAC AACGGCACTG CGCCGAGTTC CTGCGGACCA 12 50 

AAACTCTTCC GGGCCTGATG CCTACGAAGA AGTAACAGAT GGAGCACAGA 1300 

CTGGAGGGCT AGGCTTCAAC TTGCGCATCG GAAGGCCCAA AGGGCCCCGG 13 50 

GATCCTCCAG CAGAGTGGAC ACGGGTGTAA CGAGTGCTGT GCCAGTTATA 14 00 

GCCTGCCACT CGGTGGTGGC TGGACTCCTG GAACCACCAT ACTGCTGTGC 14 50 

20 AGTGTGTTAT GTATGAGTGG GACTTGTGGG CCTGATTCAA AAT AAAAGTT 1500 

TCTCAGGGCG GAAAAAAAAA AAAAAAAAA 1529 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Arg His Thr Asp Leu Vai Arg Ser Phe Leu GIu Gin Leu 
15 10 15 

30 Glu Ala Arg Asp Tyr Arg Glu Gly Ala lie Phe Val Arg G.lu Phe 

20 25 * 30 

Ser Asp lie Lys Ala Arg Ser Val Ala Trp Lys Ser Glu Gly Val 
35 40 45 

Cys Ser Thr Lys Ala Gly Ser Arg Leu Gly Asn Thr Asn Lys Asn 
35 50 55 60 

Arg Tyr Lys Asp Val Val Ala Tyr Asp Glu Thr Arg Val He Leu 

65 7 0 7 5 
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Ser Leu Leu GJ n Glu Glu Gly His Gly Asn Tyr lie Asn Ala As:: 

8 0 8 5 90 

Phe lie Arq Glv lie Asp Gly Ser Gin Ala Tvr He Ala Thr Gin 

95 IOC 1C5 

5 Gly Pro Leu Pro His Thr Leu Leu Asp Phe Trp Arg Leu Vai Trp 

110 lib 120 

Glu Phe Gly Val Lys Val He Leu Mot Ala Cys Gin Glu Thr Glu 

125 130 135 

Asn Gly Arg Arg Lys Cys Glu Arq Tyr Trp Ala Arg Glu Gin Glu 

10 140 145 150 

Pro Leu Lys Ala GJ y Pro Phe Cys He Thr Leu Thr Lys Glu Thr 

155 160 165 

Thr Leu Asn Ala Asp He Thr Leu Arg Thr Leu Gin Val Thr Pho 

170 1*75 180 

15 Gin Lys Glu Phe Arg Ser Val His Gin Leu Gin Tyr Met Ser Trp 

185 190 195 

Pro Asp His Gly Val Pro Ser Ser Ser Asp His He Leu Thr Met 

200 205 210 

Val Glu Glu Ala Arg Cys Leu Gin Gly Leu Gly Pro Glv Pro Leu 

20 215 220 "" 225 

Cys Val His Cys Ser Ala Gly Cys Gly Arg Thr Gly Val Leu Cys 

230 235 240 

Ala Vai Asp Tyr Val Arg Gin Leu Leu Leu Thr Gin Thr He Pro 

245 250 255 

25 Pro Asn Phe Ser Leu Phe Gin Val Val Leu Gi u Met Arg Lys Gin 

260 265 270 

Arg Pro Ala Ala Val Gin Thr Glu Glu Gin Tyr Arg Phe Leu Tyr 

275 280 285 

His Thr Val Ala Gin Leu Phe Ser Arg Thr Leu Gin Asp Thr Ser 

30 290 295 300 

Pro Gin Tyr Gin Asn Leu Lys Glu Asn Cys Ala Pro He Cys Lys 

305 310 315 

Glu Ala Phe Ser Leu Arg Thr Ser Ser Ala Leu Pro Ala Thr Se - 

320 325 33u 

35 Arg Pro Pro Gly Gly Val Leu Arg Ser lie Ser Val Pro Ala Pro 

335 340 345 

Pro Thr Leu Pro Met Ala Asp Thr Tyr Ala Val Val Gin Lys Arg 

350 ' 355 360 

Gly Ala Ser Ala Gly Thr Gly Pro Gly Pro Arq Ala Pro Thr Ser 

40 365 370 375 

Thr Asp Thr Pro He Tyr Ser Gin Val Ala Pro Arq Ala Gin Arq 

380 335 390 

Pro Val Ala His Thr Glu Asp Ala Gin Gly Thr Thr Ala Leu Arq 

395 400 405 
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Arc Val Pro Ala Asp Gin Asn Ser 3er G.ly Pro Asp A J. a Tyr Glu 

4 10 .4 1b '4 20 

Glu Val Thr Asp Gly Ala Gin Thr Gly Giy Leu Gly Phe Asn Leu 

4 25 4 30 ' 4 3 5 

5 Arg lie Gly Arg Pro Lys Giy Pro Arq Asp Pro Pro Ala Glu Trp 

4 40 44 5 4 50 



Thr Arg Val 

453 



(2) INFORMATION FOR SEQ ID NO : 3 : 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 
. (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CACGGTCGAC GGTGAGGAGC TTCTTTGAGC AGCTGGAGG 3 9 



INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
<D) TOPOLOGY: Linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GTTGCGGCCG CGATTGGAGC GCAGTTCTCC TTGAGGTTCT GG 4 2 



25 {?.) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CCTGGAGGGT CCTGAGAGTG ATGTCTGCAT TCAGTG 3 6 
{2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 
35 . (A) LENGTH: 34 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
40 CCTCTTGGAG CAGGGAAAGG ATGACTCTTG TCTC 3 4 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: Nucleic Acid 
45 fC) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ JD NO:7 : 
CAGCTGCTCC AAGAAGCTCC TCACCAAGTC 30 
(2) INFORMATION FOR SEQ ID NO : 8 : 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GGTAGAGGTG GGCAGGGTGA AGTGTTCTCG C 31 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
15 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
(Pi TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ TD NO : 9 : 

CACTGAATGC AGACATCACT CTCAGGACCC TCCAGG 36 

20 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAGACAAGAG TCATCCTTTC CCTGCTCCAA GAGG 34 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
35 GAATGGTAAC CTGGAGGGTC CTGAG 2 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 
40 <C) STRANDEDNESS: Single 

{ D ) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GAGAAGGTCG TGTTCGAG 18 
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(2) INFORMATION FOR SEQ ID NO : 1 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

5 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 : 

GTGTACTTCC TGTGCCTG IB 

(2) INFORMATION FOR SEQ ID NO : 1 4 : 

10 (i.) SEQUENCE CHARACTERISTICS: 

{ A ) LENGTH: 18 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 : 

ANTTNTGGNG ATGNTTGG 18 

(2) INFORMATION FOR SEQ ID NO : 1 5 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: IS base pairs 
20 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGACNNNNTC GGCCA 15 
25 (2) INFORMATION FOR SEQ ID NO: 16: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: SingJe 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 6 : 



35 



40 



GCGCGGGGCG 


GCCGGGAGGG 


GGCAGTCCTC 


GCCGGCGAGT 


TCAGCGACAT 


50 


CCAGGCCTGC 


TCGGCCGCCT 


GGAAGGCTGA 


CGGCGTGTGC 


TCCACCCTGG 


100 


CCGGCAGTCG 


GCCAGAGAAC 


GTGAGGAAGA 


ACCGCTACAA 


AGACGTGCTG 


150 


CCTTATGATC 


AGACGCGAGT 


AATCCTCTCC 


CTGCTCCAGG 


AAGAGGGACA 


200 


CAGCGACTAC 


ATTAATGGCA 


ACTTCATCCG 


GGGCGTGGAT 


GGAAGCCTGG 


250 


CCTACATTGC 


CACGCAAGGA 


CCCTTGCCTC 


ACACCCTGCT 


AGACTTCTGG 


300 


AGACTGGTCT 


GGGAGTTTGG 


GGTCAAGGTG 


ATCCTGATGG 


CCTGTCGAGA 


350 


GATAGAGAAT 


GGGCGGAAAA 


GGTGTGAGCG 


GTACTGGGCC 


CAGGAGCAGG 


400 


AGCCACTGCA 


GACTGGGCTT 


TTCTGCATCA 


CTCTGATAAA 


GGAGAAGTGG 


450 
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(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15b ar.ino acids 
5 ( B } TYPE: Amino Acid 

( 0) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Arg Gly Gly Arg Glu Gly Ala Val Leu Ala Gly Glu fhe Ser 

1 5 10 15 

10 As D lie Gin Ala Cvs Ser Ala Ala Trp Lys Ala Asp GJ y Val Cys 

20 25 30 

Ser Thr Val Ala Gly Ser Arg Pro Glu Asn Vai Arq Lys Asn Arg 

3 5 4 0 4 5 

Tyr Lys Asp Val Leu Pro Tyr Asp Gin Thr Arg Val lie Leu Ser 

15 50 " 55 60 

Leu Leu Gin Glu Glu Gly His Ser Asp Tyr 1 .1 e Asn Gly Asn Pho 

65 70 7 5 

lie Arg G.l y Vai Asp Gly Ser Leu Ala Tyr He Ala Thr Gin Gly 

8 0 R5 90 

20 Pro Leu Pro His Thr Leu Leu Asp Phe Trp Arg Leu Val Trp Glu 

95 100 10b 

Phc Gly Val Lys Val He Leu Met Ala Cys Arg Glu He Glu Asn 

110 115 120 

Gly Arg Lys Arg Cys Glu Arg Tyr Trp Ala Gin Glu Gin Glu Pro 

25 125 130 135 

Leu Gin Thr Gly Leu Phe Cys He Thr Leu lie Lys Glu Lys Trp 

14 0 14 5 150 

Leu Asn Glu Asp He 
155 

30 {?.) INFORMATION FOR SEQ ID NO : 1 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Phe Ala Ser Glu Phe Leu Lys Leu Lys Arg Gin Ser Thr Lys Tyr 

1 5 10 15 

Lys Ala Asp Lys He Tyr Pro Thr Thr Val Ala Gin Arq Pro Lys 

2 0 2 5 30 

40 Asn Tie Lys Lys Asn Arg Tyr Lys Asp He Leu Pro Tyr Asp His 

3 5 4 0 4 5 

Ser Leu Val Glu Leu Ser Leu Leu Thr Ser Asp Glu Asp Ser Ser 

50 55 60 

Tyr He Asn Ala Ser Phe He Lys Gly Val Tyr Gly Pro Lys Ala 

45 6 5 7 0 lb 
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Tyr lie Ala Thr Gin Giy Pro Leu Ser Thr Thr Leu Leu Asp Phe 

8 0 .85 90 

Trp Arg Met lie Trp Glu Tyr Arg lie Leu Va 1 lie Va.l Met Ala 

9 5 10 0 10 5 

5 Cys Met Glu Phe Glu Met Gly Lys Lys Lys Cys Giu Arq Tyr Trp 

110 115 J " 120 

Aia Glu Pro Gly Glu Thr Gin Leu Gin Phe Gly Pro Phe Ser lie 

125 130 135 

Ser Cys Glu Ala Glu Lys Lys Lys Ser Asp Tyr Lys lie Arq Thr 
10 140 14 5 ISC- 
Leu Lys Ala Lys Phe Asn Asn Glu Thr Arg lie lie Tyr Gin Phe 
155 160 165 

His Tyr Lys Asn Trp Pro Asp His Asp Val Pro Ser Ser lie Asp 

170 ' 175 180 

15 Pro He Leu Gin Leu He Trp Asp Met Arg Cys Tyr Gin Giu Asp 

185 190 195 

Asp Cys Val Pro lie Cys lie His Cys Ser Aia Giv Cys Giy Ara 

200 205 " 210 

Thr Giy Val He Cys Ala Val Asp Tyr Thr Trp Met Leu Leu Lys 

20 215 220 "225 

Asp Gly He He Pro Lys Asn Phe Ser Val Phe Asn Leu He Gin 

230 235 240 

Glu Met Arg Thr Gin Arg Pro Scr Leu Val Gin Thr Gin Glu Gin 

245 250 255 

25 Tyr Giu Leu Val Tyr Ser Ala Val Leu Glu Leu Phe Lys Arg His 

260 265 " 270 

Met Asp Val lie Ser Asp Asn His 

275 278 

(2) INFORMATION FOR SEQ ID NO: 19: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: Amino Acid 
{ D ) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 

35 Phe Ala Arg Asp Phe Met Arg Leu Arg Arg Leu Ser Thr Lys Tyr 

15 10 15 

Arg Thr Glu Lys He Tyr Pro Thr Ala Thr Gly Glu Lys Glu Glu 

20 25 ^ 30 

Asn Val Lys Lys Asn Arg Tyr Lys Asp He Leu Pro Phe Asp His 

40 35 40 45 

Ser Arg Val Lys Leu Thr Leu Lys Thr Pro Ser Gin Asp Ser Asd 

50 55 60 

Tyr He Asn Ala Asn Phe He Lys Gly Val Tyr GJy Pro Lys Aia 

65 70 ' 75 
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Tyr Val Ala Thr Gin Gly Pro Leu Ala Asn Thr Val He Asp Pne 

80 85 * 9C: 

Trp Arq Met Val Ttd Glu Tyr Asn Val Val 11c He Vai Mot Ala 
95 100 105 

5 Cys Arg Glu Phe Glu Met Gly Arg Lys Lys Cys Giu Arg Tyr Tro 

110 US 120 

Pro Leu Tyr Gly Glu Asp Pro He Thr Phe Ala fro Phe Lys He 
125 130 135 

Ser Cys Glu Asp Glu Gin Ala Arg Thr Asp Tyr Phe He Arg Thr 
10 140 145 IbC 

Leu Leu Leu Glu Phe Gin Asn Glu Ser Arg Arg Leu Tyr Gin Phe 
155 160 165 

His Tyr Val Asn Trp Pro Asp His Asp Val Pre Ser Ser Phe Asp 
17 0 17 5 ISO 

15 Ser He Leu Asp Met He Ser Leu Met: Arg Lys Tyr Gin Glu His 

185 190 1 

Glu Asp Val Pro He Cys lie His Cys Ser Ala Gly Cys Gly Arg 
200 205 210 

Thr Gly Ala lie Cys Ala He Asp Tyr Thr Trp Asn Leu Leu Lys 
20 215 220 225 

Ala Gly Lys He Pro Glu Glu Phe Asn Val Phe Asn Leu He Gin 
230 235 240 

Giu Met Arg Thr Gin Arg His Ser Ala Val Gin Thr Lys Glu Gin 
245 250 255 

25 Tyr Glu Leu Val His Arg Ala He Ala Gin Leu Phe Giu Lys Gin 

260 265 270 

Leu Gin 
272 

(2) INFORMATION FOR SEQ ID NO: 20: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

ACTTGGTGAG GAGCTTCTTG GAGCAGCTGG AGG 33 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
40 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
GGAATGTAAC CTGGAGGGTC CTGA 2 4 
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(2) INFORMATION FOR SEQ ID NO: 22: 

:i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: Amino Acid 
5 {C) TOPOLOGY : Linear 

<>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Gly Phe Gly Asn Arg Phe Ser Lys Pro Lys Gly Pro Arc Asn Pro 
1 5 10 " 15 

Pro Ser Al a Trp 
10 19 

(2) INFORMATION FOR SEQ ID NO: 23: 

f.i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: Amino Acid 
15 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 3 : 

lie Gly Phe Gly Asn Arq Cys Gly Lys Pro Lys Gly Pro Arc- Asp 

1 ^ .'10 " lb 

Pro Pro Ser Glu Trp Thr 
20 20 21 
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Claims: 

1. An isolated non-receptor protein tyrosine phosphatase of hematopoietic stem cells (PTP I ISC), 

which 

(I) is expressed predominantly in early hematopoietic stem cells or progenitor cells; 
5 (2) predominantly lacks expression in adult tissues; 

(3) comprises an N-terminal tyrosine phosphatase domain, followed by a region rich in 
serine, threonine, and proline, and a carboxy terminal region of about 15 to 25 amino acids rich in basic amino 
acid residues; and 

(4) is capable of tyrosine dcphosphorylation in hematopoietic stem cells or progenitor 

10 cells. 

2. The PTP HSC of claim 1 which is murine. 

3. The PTP HSC of claim 1 which is human. 

4. The PTP HSC of claim 1 or a derivative thereof, which downregulates STAT activation. 

5. An antagonist of the PTP HSC of claim 1 . 
15 6. An antagonist of the PTP HSC of claim 4. 

7. An isolated non-receptor protein tyrosine phosphatase of hematopoietic stem cells (PTP HSC) 
selected from the group consisting of: 

( 1 ) a protein comprising the amino acid sequence shown in Figure I (SEQ. ID. NO:2); 

(2) a protein comprising the amino acid sequence shown in Figure 8 (SEQ. ID NO: 17); 
20 (3) a mammalian homologuc of protein ( 1 ) or protein (2); and 

(4) a derivative of proteins ( 1 ) - (2) retaining the ability of tyrosine dephosphorylation 
in hematopoietic stem cells or progenitor cells. 

8. The PTP HSC of claim 7 comprising an active N-terminal tyrosine phosphatase domain, 
retaining a serine residue at a position corresponding to amino acid position 37 in Figure I, a region rich in 

25 serine, threonine, and proline, retaining an active site cysteine residue at a position corresponding to amino acid 
position 229 in Figure 1, and a carboxy-terminal region showing at least about 80% sequence homology with 
the amino acid sequence between positions 430 and 45 1 in Figure I, said derivative having an at least about 65% 
overall sequence homology with the amino acid sequence shown in Figure I and retaining the ability of tyrosine 
dephosphorylation in hematopoietic progenitor cells. 
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9. The PTP HSC of claim 7, comprising the amino acid sequence shown in Figure 1 (SEQ. ID. 

NO: 2), or in Figure 8 (SEQ- ID. NO: 17). 

10. An antagonist of the PTP HSC of claim 7. 

11. An isolated nucleic acid molecule encoding the PTP HSC of claim I . 
5 12. An isolated nucleic acid molecule encoding the PTP HSC of claim 7, 

13. An isolated nucleic acid molecule encoding the PTP HSC of claim 1 1 . 

14. A vector comprising the nucleic acid molecule of claim 1 1 operably linked to control 
sequences recognized by a host cell transformed with the vector. 

15. A host cell transformed with the vector of claim 13. 

10 16. An antibody capable of specific binding to the PTP HSC of claim 7. 

1 7. A hybridoma cell line producing an antibody of claim 15. 

1 8. An assay for identifying an antagonist or agonist of a PTP HSC of claim I . which comprises 
contacting the phosphatase domain of said PTP HSC with a candidate antagonist or agonist, and monitoring 
the ability of said phosphatase domain to dephosphorylate tyrosine residues. 
15 19. An assay for identifying an antagonist or agonist of a PTP HSC of claim I , which comprises 

cultivating a PTP HSC-expressing hematopoietic stem or progenitor cell line in the presence of a candidate 
antagonist or agonist, and monitoring the differentiation of the stem or progenitor cells. 

20. A method for the differentiation of undifferentiated malignant hematopoietic cells, comprising 
contacting said cells with an antagonist of a PTP HSC according to claim 7. 
20 21. The method of claim 19 wherein said cells are leukemia cells. 

22. A method for the induction of differentiation of stem cells, comprising contacting said cells 
with an antagonist of a PTP HSC according to claim 7. 

23. A method for the expansion undifferentiated stems cells in cell culture, comprising cultivatinc 
stem cells in the presence of a PTP HSC according to claim 7 or an agonist antibody specifically binding a native 

25 PTP HSC. 

24. A method for the expansion of undifferentiated stem cells in vivo comprising administering 
to a patient an agonist of a PTP HSC according to claim 7 or an agonist antibody specifically bindinu a native 
PTP HSC, and a hematopoietic growth factor. 
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295 TLQD TSP HYQ NLK E NCA|PjIC 
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315KEAF d L R SHU A L d A IS [1 R i IB G 
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335 GVLR dlU VGE]A @ @ 0 L ©MA ; D [f) Y 
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355 A V V Q KRG A d A G E) G ED G gj R A [U (3 
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1261 
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U @ D @ |H I . Y 1 0 V A H R A QR[P]:VAH 
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395 © E D A QgSSaL RRVE ADQ N [S] [S] 



GGGCCTGATG CCTACGAAGA AGTAACAGAT GGAGCACAGA CTGGAG9^^^QG<p^^C 
G 0 D A YEE V S D GAQ@ G 6|j B lBKa 




^Q!S^^^^^^W^^^m§§i ACGGGTGTAA 

r v o 



1381 CGAGTGCTGT GCCAGTTATA GCCTGCCACT CGGTGGTGGC TGGACTCCTG GAACCACCAT 
1441 ACTGCTGTGC AGTGTGTTAT GTATGAGTGG GACTTGTGGG CCTGATTCAA AATAAAAGTT 
1501 TCTCAGGGCA GAAAAAAAAA AAAAAAAAA 
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GCGCGGGGCG GCCGGGAGGG GGCAGTCCTC GCCGGCGAGT T C AG CG AC AT 50 
CCAGGCCTGC TCGGCCGCCT GGAAGGCTGA CGGCGTGTGC TCCACCGTGG 100 
CCGGCAGTCG GCCAGAGAAC GTGAGGAAGA ACCGCTACAA AGACGTGCTG 150 
CCTTATGATC AGACGCGAGT AATCCTCTCC CTGCTCCAGG AAGAGGGACA 200 
CAGCGACTAC ATTAATGGCA ACTTCATCCG GGGCGTGG AT GGAAGCCTGG 2 50 
CCTACATTGC CACGCAAGGA CCCTTGCCTC ACACCCTGCT AGACTTCTGG 300 
AGACTGGTCT GGGAGTTTGG GGTCAAGGTG ATCCTGATGG CCTGTCGAGA 350 
GATAGAGAAT GGGCGGAAAA GGTGTGAGCG GTACTGGGCC CAGGAGCAGG 4 00 
AGCCACTGCA GACTGGGCTT TTCTGCATCA CTCTGATAAA GGAGAAGTGG 4 50 
CTGAATGAGG ACATCA 4 66 

FIG. 8A 
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